TU Darmstadt / ULB / TUbiblio

From Local Patterns to Global Models: The LeGo Approach to Data Mining

Crémilleux, Bruno ; Fürnkranz, Johannes ; Knobbe, Arno J. ; Scholz, Martin (2007)
From Local Patterns to Global Models: The LeGo Approach to Data Mining.
Report, Bibliographie

Kurzbeschreibung (Abstract)

In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of which is formulated in terms of different formal constraints. It starts with a phase of mining patterns that are individually promising. Later phases establish the context given by the global data mining task by selecting groups of diverse and highly informative patterns, which are finally combined to one or more global models that address the overall data mining task(s). The paper discusses the connection to various learning techniques, and illustrates that our framework is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams, multi-view learning, and several other popular algorithms. The Safarii learning toolbox serves as a proof-of-concept of its high potential for practical data mining applications. Finally, we point out several challenging open research questions that naturally emerge in a constraint-based local-to-global pattern mining, selection, and combination framework.

Typ des Eintrags: Report
Erschienen: 2007
Autor(en): Crémilleux, Bruno ; Fürnkranz, Johannes ; Knobbe, Arno J. ; Scholz, Martin
Art des Eintrags: Bibliographie
Titel: From Local Patterns to Global Models: The LeGo Approach to Data Mining
Sprache: Englisch
Publikationsjahr: 2007
URL / URN: http://www.ke.informatik.tu-darmstadt.de/publications/report...
Kurzbeschreibung (Abstract):

In this paper we present LeGo, a generic framework that utilizes existing local pattern mining techniques for global modeling in a variety of diverse data mining tasks. In the spirit of well known KDD process models, our work identifies different phases within the data mining step, each of which is formulated in terms of different formal constraints. It starts with a phase of mining patterns that are individually promising. Later phases establish the context given by the global data mining task by selecting groups of diverse and highly informative patterns, which are finally combined to one or more global models that address the overall data mining task(s). The paper discusses the connection to various learning techniques, and illustrates that our framework is broad enough to cover and leverage frequent pattern mining, subgroup discovery, pattern teams, multi-view learning, and several other popular algorithms. The Safarii learning toolbox serves as a proof-of-concept of its high potential for practical data mining applications. Finally, we point out several challenging open research questions that naturally emerge in a constraint-based local-to-global pattern mining, selection, and combination framework.

ID-Nummer: TUD-KE-2007-06
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Knowledge Engineering
Hinterlegungsdatum: 24 Jun 2011 15:25
Letzte Änderung: 03 Jun 2018 21:24
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen