TU Darmstadt / ULB / TUbiblio

Sense-annotating a lexical substitution data set with Ubyline

Miller, Tristan ; Khemakhem, Mohamed ; Eckart de Castilho, Richard ; Gurevych, Iryna
Hrsg.: Calzolari, Nicoletta (2016)
Sense-annotating a lexical substitution data set with Ubyline.
Tenth International Conference on Language Resources and Evaluation (LREC 2016). Portorož, Slovenia (May 23-28, 2016)
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

We describe the construction of GLASS, a newly sense-annotated version of the German lexical substitution data set used at the GermEval 2015: LexSub shared task. Using the two annotation layers, we conduct the first known empirical study of the relationship between manually applied word senses and lexical substitutions. We find that synonymy and hypernymy/hyponymy are the only semantic relations directly linking targets to their substitutes, and that substitutes in the target's hypernymy/hyponymy taxonomy closely align with the synonyms of a single GermaNet synset. Despite this, these substitutes account for a minority of those provided by the annotators. The results of our analysis accord with those of a previous study on English-language data (albeit with automatically induced word senses), leading us to suspect that the sense–substitution relations we discovered may be of a universal nature. We also tentatively conclude that relatively cheap lexical substitution annotations can be used as a knowledge source for automatic WSD. Also introduced in this paper is Ubyline, the web application used to produce the sense annotations. Ubyline presents an intuitive user interface optimized for annotating lexical sample data, and is readily adaptable to sense inventories other than GermaNet.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2016
Herausgeber: Calzolari, Nicoletta
Autor(en): Miller, Tristan ; Khemakhem, Mohamed ; Eckart de Castilho, Richard ; Gurevych, Iryna
Art des Eintrags: Bibliographie
Titel: Sense-annotating a lexical substitution data set with Ubyline
Sprache: Englisch
Publikationsjahr: Mai 2016
Ort: Paris
Verlag: European Language Resources Association (ELRA)
Buchtitel: LREC 2016, Tenth International Conference on Language Resources and Evaluation : May 23-28, 2016, Grand Hotel Bernardin Conference Center, Portorož, Slovenia
Veranstaltungstitel: Tenth International Conference on Language Resources and Evaluation (LREC 2016)
Veranstaltungsort: Portorož, Slovenia
Veranstaltungsdatum: May 23-28, 2016
URL / URN: http://www.lrec-conf.org/proceedings/lrec2016/summaries/108....
Zugehörige Links:
Kurzbeschreibung (Abstract):

We describe the construction of GLASS, a newly sense-annotated version of the German lexical substitution data set used at the GermEval 2015: LexSub shared task. Using the two annotation layers, we conduct the first known empirical study of the relationship between manually applied word senses and lexical substitutions. We find that synonymy and hypernymy/hyponymy are the only semantic relations directly linking targets to their substitutes, and that substitutes in the target's hypernymy/hyponymy taxonomy closely align with the synonyms of a single GermaNet synset. Despite this, these substitutes account for a minority of those provided by the annotators. The results of our analysis accord with those of a previous study on English-language data (albeit with automatically induced word senses), leading us to suspect that the sense–substitution relations we discovered may be of a universal nature. We also tentatively conclude that relatively cheap lexical substitution annotations can be used as a knowledge source for automatic WSD. Also introduced in this paper is Ubyline, the web application used to produce the sense annotations. Ubyline presents an intuitive user interface optimized for annotating lexical sample data, and is readily adaptable to sense inventories other than GermaNet.

Freie Schlagworte: reviewed;UKP_a_LangTech4eHum;UKP_a_LSRA;UKP_reviewed
ID-Nummer: TUD-CS-2016-0022
Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung
DFG-Graduiertenkollegs
DFG-Graduiertenkollegs > Graduiertenkolleg 1994 Adaptive Informationsaufbereitung aus heterogenen Quellen
Hinterlegungsdatum: 31 Dez 2016 14:29
Letzte Änderung: 09 Feb 2024 13:29
PPN:
Zugehörige Links:
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen