TU Darmstadt / ULB / TUbiblio

An Evaluation of Efficient Multilabel Classification Algorithms for Large-Scale Problems in the Legal Domain

Loza Mencía, Eneldo ; Fürnkranz, Johannes
Hinneburg, Alexander (ed.) :

An Evaluation of Efficient Multilabel Classification Algorithms for Large-Scale Problems in the Legal Domain.
[Online-Edition: http://www.ke.informatik.tu-darmstadt.de/publications/papers...]
Proceedings of the LWA 2007: Lernen - Wissen - Adaption
[Konferenz- oder Workshop-Beitrag], (2007)

Offizielle URL: http://www.ke.informatik.tu-darmstadt.de/publications/papers...

Kurzbeschreibung (Abstract)

In this paper we evaluate the performance of multilabel classification algorithms on two classification tasks related to documents of the EUR-Lex database of legal documents of the European Union. It permits different settings of large-scale multilabel problems with up to 4000 classes with the same underlying documents. We compared the well known one-against-all approach (OAA) and its recently proposed improvement, the multiclass multilabel perceptron algorithm (MMP), which modifies the OAA ensemble by respecting dependencies between the base classifiers in the training protocol of the classifier ensemble. Both use the simple but very efficient perceptron algorithm as underlying classifier. This makes them very suitable for large-scale multilabel classification problems, in particular when the number of classes is high. Our results on the EUR-Lex database confirm that the MMP algorithm has a better response to an increasing number of classes than the one-against-all approach. We also show that it is principally possible to efficiently and effectively handle very large multilabel problems.

Typ des Eintrags: Konferenz- oder Workshop-Beitrag (Keine Angabe)
Erschienen: 2007
Herausgeber: Hinneburg, Alexander
Autor(en): Loza Mencía, Eneldo ; Fürnkranz, Johannes
Titel: An Evaluation of Efficient Multilabel Classification Algorithms for Large-Scale Problems in the Legal Domain
Sprache: Englisch
Kurzbeschreibung (Abstract):

In this paper we evaluate the performance of multilabel classification algorithms on two classification tasks related to documents of the EUR-Lex database of legal documents of the European Union. It permits different settings of large-scale multilabel problems with up to 4000 classes with the same underlying documents. We compared the well known one-against-all approach (OAA) and its recently proposed improvement, the multiclass multilabel perceptron algorithm (MMP), which modifies the OAA ensemble by respecting dependencies between the base classifiers in the training protocol of the classifier ensemble. Both use the simple but very efficient perceptron algorithm as underlying classifier. This makes them very suitable for large-scale multilabel classification problems, in particular when the number of classes is high. Our results on the EUR-Lex database confirm that the MMP algorithm has a better response to an increasing number of classes than the one-against-all approach. We also show that it is principally possible to efficiently and effectively handle very large multilabel problems.

Buchtitel: Proceedings of the LWA 2007: Lernen - Wissen - Adaption
Fachbereich(e)/-gebiet(e): Fachbereich Informatik > Knowledge Engineering
Fachbereich Informatik
Hinterlegungsdatum: 24 Jun 2011 15:28
Offizielle URL: http://www.ke.informatik.tu-darmstadt.de/publications/papers...
Export:

Optionen (nur für Redakteure)

Eintrag anzeigen Eintrag anzeigen