Crémilleux, Bruno ; Fürnkranz, Johannes ; Knobbe, Arno J. ; Scholz, Martin (2007)
From Local Patterns to Global Models: The LeGo Approach to Data Mining.
Report, Bibliographie
Kurzbeschreibung (Abstract)
In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of which is formulated in terms of different formal constraints. It starts with a phase of mining patterns that are individually promising. Later phases establish the context given by the global data mining task by selecting groups of diverse and highly informative patterns, which are finally combined to one or more global models that address the overall data mining task(s). The paper discusses the connection to various learning techniques, and illustrates that our framework is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams, multi-view learning, and several other popular algorithms. The Safarii learning toolbox serves as a proof-of-concept of its high potential for practical data mining applications. Finally, we point out several challenging open research questions that naturally emerge in a constraint-based local-to-global pattern mining, selection, and combination framework.
Typ des Eintrags: | Report |
---|---|
Erschienen: | 2007 |
Autor(en): | Crémilleux, Bruno ; Fürnkranz, Johannes ; Knobbe, Arno J. ; Scholz, Martin |
Art des Eintrags: | Bibliographie |
Titel: | From Local Patterns to Global Models: The LeGo Approach to Data Mining |
Sprache: | Englisch |
Publikationsjahr: | 2007 |
URL / URN: | http://www.ke.informatik.tu-darmstadt.de/publications/report... |
Kurzbeschreibung (Abstract): | In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of which is formulated in terms of different formal constraints. It starts with a phase of mining patterns that are individually promising. Later phases establish the context given by the global data mining task by selecting groups of diverse and highly informative patterns, which are finally combined to one or more global models that address the overall data mining task(s). The paper discusses the connection to various learning techniques, and illustrates that our framework is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams, multi-view learning, and several other popular algorithms. The Safarii learning toolbox serves as a proof-of-concept of its high potential for practical data mining applications. Finally, we point out several challenging open research questions that naturally emerge in a constraint-based local-to-global pattern mining, selection, and combination framework. |
ID-Nummer: | TUD-KE-2007-06 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Knowledge Engineering |
Hinterlegungsdatum: | 24 Jun 2011 15:25 |
Letzte Änderung: | 03 Jun 2018 21:24 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |