Sulzmann, Jan-Nikolas ; Fürnkranz, Johannes (2010)
Probability Estimation and Aggregation for Rule Learning.
Report, Bibliographie
Kurzbeschreibung (Abstract)
Rule learning is known for its descriptive and therefore comprehensible classification models which also yield good class predictions. For different classification models, such as decision trees, a variety of techniques for obtaining good probability estimates have been proposed and evaluated. However, so far, there has been no systematic empirical study of how these techniques can be adapted to probabilistic rules and how these methods affect the probability-based rankings. In this paper we apply several basic methods for the estimation of class membership probabilities to classification rules. We also study the effect of a shrinkage technique for merging the probability estimates of rules with those of their generalizations. Finally, we compare different ways of combining probability estimates from an ensemble of rules. Our results show that for probability estimation it is beneficial to exploit the fact that rules overlap (i.e., rule averaging is preferred over rule sorting), and that individual probabilities should be combined at the level of rules and not at the level of theories.
Typ des Eintrags: | Report |
---|---|
Erschienen: | 2010 |
Autor(en): | Sulzmann, Jan-Nikolas ; Fürnkranz, Johannes |
Art des Eintrags: | Bibliographie |
Titel: | Probability Estimation and Aggregation for Rule Learning |
Sprache: | Englisch |
Publikationsjahr: | 2010 |
URL / URN: | http://www.ke.tu-darmstadt.de/publications/reports/tud-ke-20... |
Kurzbeschreibung (Abstract): | Rule learning is known for its descriptive and therefore comprehensible classification models which also yield good class predictions. For different classification models, such as decision trees, a variety of techniques for obtaining good probability estimates have been proposed and evaluated. However, so far, there has been no systematic empirical study of how these techniques can be adapted to probabilistic rules and how these methods affect the probability-based rankings. In this paper we apply several basic methods for the estimation of class membership probabilities to classification rules. We also study the effect of a shrinkage technique for merging the probability estimates of rules with those of their generalizations. Finally, we compare different ways of combining probability estimates from an ensemble of rules. Our results show that for probability estimation it is beneficial to exploit the fact that rules overlap (i.e., rule averaging is preferred over rule sorting), and that individual probabilities should be combined at the level of rules and not at the level of theories. |
ID-Nummer: | TUD-KE-2010-03 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik > Knowledge Engineering 20 Fachbereich Informatik |
Hinterlegungsdatum: | 24 Jun 2011 14:09 |
Letzte Änderung: | 05 Mär 2013 09:49 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |