TU Darmstadt / ULB / TUbiblio

Sense-annotating a lexical substitution data set with Ubyline

Miller, Tristan and Khemakhem, Mohamed and Eckart de Castilho, Richard and Gurevych, Iryna (2016):
Sense-annotating a lexical substitution data set with Ubyline.
In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), European Language Resources Association (ELRA), ISBN 978-2-9517408-9-1,
[Online-Edition: http://www.lrec-conf.org/proceedings/lrec2016/pdf/108_Paper....],
[Conference or Workshop Item]

Abstract

We describe the construction of GLASS, a newly sense-annotated version of the German lexical substitution data set used at the GermEval 2015: LexSub shared task. Using the two annotation layers, we conduct the first known empirical study of the relationship between manually applied word senses and lexical substitutions. We find that synonymy and hypernymy/hyponymy are the only semantic relations directly linking targets to their substitutes, and that substitutes in the target's hypernymy/hyponymy taxonomy closely align with the synonyms of a single GermaNet synset. Despite this, these substitutes account for a minority of those provided by the annotators. The results of our analysis accord with those of a previous study on English-language data (albeit with automatically induced word senses), leading us to suspect that the sense–substitution relations we discovered may be of a universal nature. We also tentatively conclude that relatively cheap lexical substitution annotations can be used as a knowledge source for automatic WSD. Also introduced in this paper is Ubyline, the web application used to produce the sense annotations. Ubyline presents an intuitive user interface optimized for annotating lexical sample data, and is readily adaptable to sense inventories other than GermaNet.

Item Type: Conference or Workshop Item
Erschienen: 2016
Creators: Miller, Tristan and Khemakhem, Mohamed and Eckart de Castilho, Richard and Gurevych, Iryna
Title: Sense-annotating a lexical substitution data set with Ubyline
Language: English
Abstract:

We describe the construction of GLASS, a newly sense-annotated version of the German lexical substitution data set used at the GermEval 2015: LexSub shared task. Using the two annotation layers, we conduct the first known empirical study of the relationship between manually applied word senses and lexical substitutions. We find that synonymy and hypernymy/hyponymy are the only semantic relations directly linking targets to their substitutes, and that substitutes in the target's hypernymy/hyponymy taxonomy closely align with the synonyms of a single GermaNet synset. Despite this, these substitutes account for a minority of those provided by the annotators. The results of our analysis accord with those of a previous study on English-language data (albeit with automatically induced word senses), leading us to suspect that the sense–substitution relations we discovered may be of a universal nature. We also tentatively conclude that relatively cheap lexical substitution annotations can be used as a knowledge source for automatic WSD. Also introduced in this paper is Ubyline, the web application used to produce the sense annotations. Ubyline presents an intuitive user interface optimized for annotating lexical sample data, and is readily adaptable to sense inventories other than GermaNet.

Title of Book: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
Publisher: European Language Resources Association (ELRA)
ISBN: 978-2-9517408-9-1
Uncontrolled Keywords: reviewed;UKP_a_LangTech4eHum;UKP_a_LSRA;UKP_reviewed
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Ubiquitous Knowledge Processing
DFG-Graduiertenkollegs
DFG-Graduiertenkollegs > Research Training Group 1994 Adaptive Preparation of Information from Heterogeneous Sources
Date Deposited: 31 Dec 2016 14:29
Official URL: http://www.lrec-conf.org/proceedings/lrec2016/pdf/108_Paper....
Identification Number: TUD-CS-2016-0022
Related URLs:
Export:

Optionen (nur für Redakteure)

View Item View Item