TU Darmstadt / ULB / TUbiblio

Using compound lists for German decompounding in a back-off scenario

Santos, Pedro Bispo
Hrsg.: Henrich, Verena ; Hinrichs, Erhard (2014)
Using compound lists for German decompounding in a back-off scenario.
Tuebingen, Germany
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

Lexical resources like GermaNet offer compound lists of reasonable size. These lists can be used as a prior step to existing decompounding algorithms, wherein decompounding algorithms would function as a back-off mechanism. We investigate whether the use of compound lists can enhance dictionary and corpus-based decompounding algorithms. We analyze the effect of using an initial decompounding step based on a compound list derived from GermaNet with a gold standard in German. The obtained results show that applying information from GermaNet can significantly improve all tested decompounding approaches across all metrics. Precision and recall increases statistically significant by .004-.018 and .011- .022 respectively.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2014
Herausgeber: Henrich, Verena ; Hinrichs, Erhard
Autor(en): Santos, Pedro Bispo
Art des Eintrags: Bibliographie
Titel: Using compound lists for German decompounding in a back-off scenario
Sprache: Englisch
Publikationsjahr: August 2014
Verlag: Department of Linguistics (SfS), University of Tübingen and Collaborative Research Center: Emergence of Meaning (SFB 833), University of Tübingen
Buchtitel: Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014)
Veranstaltungsort: Tuebingen, Germany
URL / URN: http://www.sfs.uni-tuebingen.de/~vhenrich/cclcc_2014/CCLCC_2...
Kurzbeschreibung (Abstract):

Lexical resources like GermaNet offer compound lists of reasonable size. These lists can be used as a prior step to existing decompounding algorithms, wherein decompounding algorithms would function as a back-off mechanism. We investigate whether the use of compound lists can enhance dictionary and corpus-based decompounding algorithms. We analyze the effect of using an initial decompounding step based on a compound list derived from GermaNet with a gold standard in German. The obtained results show that applying information from GermaNet can significantly improve all tested decompounding approaches across all metrics. Precision and recall increases statistically significant by .004-.018 and .011- .022 respectively.

Freie Schlagworte: UKP_a_ENLP;UKP_a_NLP4Wikis
ID-Nummer: TUD-CS-2014-0105
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung
Hinterlegungsdatum: 31 Dez 2016 14:29
Letzte Änderung: 24 Aug 2018 08:42
PPN:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen