Santos, Pedro Bispo
Hrsg.: Henrich, Verena ; Hinrichs, Erhard (2014)
Using compound lists for German decompounding in a back-off scenario.
Tuebingen, Germany
Konferenzveröffentlichung, Bibliographie
Kurzbeschreibung (Abstract)
Lexical resources like GermaNet offer compound lists of reasonable size. These lists can be used as a prior step to existing decompounding algorithms, wherein decompounding algorithms would function as a back-off mechanism. We investigate whether the use of compound lists can enhance dictionary and corpus-based decompounding algorithms. We analyze the effect of using an initial decompounding step based on a compound list derived from GermaNet with a gold standard in German. The obtained results show that applying information from GermaNet can significantly improve all tested decompounding approaches across all metrics. Precision and recall increases statistically significant by .004-.018 and .011- .022 respectively.
Typ des Eintrags: | Konferenzveröffentlichung |
---|---|
Erschienen: | 2014 |
Herausgeber: | Henrich, Verena ; Hinrichs, Erhard |
Autor(en): | Santos, Pedro Bispo |
Art des Eintrags: | Bibliographie |
Titel: | Using compound lists for German decompounding in a back-off scenario |
Sprache: | Englisch |
Publikationsjahr: | August 2014 |
Verlag: | Department of Linguistics (SfS), University of Tübingen and Collaborative Research Center: Emergence of Meaning (SFB 833), University of Tübingen |
Buchtitel: | Workshop on Computational, Cognitive, and Linguistic Approaches to the Analysis of Complex Words and Collocations (CCLCC 2014) |
Veranstaltungsort: | Tuebingen, Germany |
URL / URN: | http://www.sfs.uni-tuebingen.de/~vhenrich/cclcc_2014/CCLCC_2... |
Kurzbeschreibung (Abstract): | Lexical resources like GermaNet offer compound lists of reasonable size. These lists can be used as a prior step to existing decompounding algorithms, wherein decompounding algorithms would function as a back-off mechanism. We investigate whether the use of compound lists can enhance dictionary and corpus-based decompounding algorithms. We analyze the effect of using an initial decompounding step based on a compound list derived from GermaNet with a gold standard in German. The obtained results show that applying information from GermaNet can significantly improve all tested decompounding approaches across all metrics. Precision and recall increases statistically significant by .004-.018 and .011- .022 respectively. |
Freie Schlagworte: | UKP_a_ENLP;UKP_a_NLP4Wikis |
ID-Nummer: | TUD-CS-2014-0105 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung |
Hinterlegungsdatum: | 31 Dez 2016 14:29 |
Letzte Änderung: | 24 Aug 2018 08:42 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |