Szarvas, György ; Vincze, Veronika ; Farkas, Richárd ; Móra, György ; Gurevych, Iryna (2012)
Cross-Genre and Cross-Domain Detection of Semantic Uncertainty.
In: Computational Linguistics, 38 (2)
Artikel, Bibliographie
Kurzbeschreibung (Abstract)
Uncertainty is an important linguistic phenomenon that is relevant in various Natural Language Processing applications, in diverse genres from medical to community generated, newswire or scientific discourse and domains from science to humanities. The semantic uncertainty of a proposition can be identified in most cases by using a finite dictionary — i.e. lexical cues — and the key steps of uncertainty detection in an application include the steps of locating the (genre- and domain-specific) lexical cues, disambiguating them, and linking them with the units of interest for the particular application (e.g. identified events in information extraction). In this study, we focus on the genre and domain differences of the context-dependent semantic uncertainty cue recognition task.
We introduce a unified subcategorization of semantic uncertainty as different domain applications can apply different uncertainty categories. Based on this categorization, we normalized the annotation of three corpora and present results with a state-of-the-art uncertainty cue recognition model for four fine-grained categories of semantic uncertainty.
Our results reveal the domain and genre dependence of the problem; nevertheless, we also show that even a distant source domain dataset can contribute to the recognition and disambiguation of uncertainty cues, efficiently reducing the annotation costs needed to cover a new domain. Thus, the unified subcategorization and domain adaptation for training the models offer an efficient solution for cross-domain and cross-genre semantic uncertainty recognition.
Typ des Eintrags: | Artikel |
---|---|
Erschienen: | 2012 |
Autor(en): | Szarvas, György ; Vincze, Veronika ; Farkas, Richárd ; Móra, György ; Gurevych, Iryna |
Art des Eintrags: | Bibliographie |
Titel: | Cross-Genre and Cross-Domain Detection of Semantic Uncertainty |
Sprache: | Englisch |
Publikationsjahr: | Juni 2012 |
Titel der Zeitschrift, Zeitung oder Schriftenreihe: | Computational Linguistics |
Jahrgang/Volume einer Zeitschrift: | 38 |
(Heft-)Nummer: | 2 |
URL / URN: | https://www.mitpressjournals.org/doi/pdf/10.1162/COLI_a_0009... |
Kurzbeschreibung (Abstract): | Uncertainty is an important linguistic phenomenon that is relevant in various Natural Language Processing applications, in diverse genres from medical to community generated, newswire or scientific discourse and domains from science to humanities. The semantic uncertainty of a proposition can be identified in most cases by using a finite dictionary — i.e. lexical cues — and the key steps of uncertainty detection in an application include the steps of locating the (genre- and domain-specific) lexical cues, disambiguating them, and linking them with the units of interest for the particular application (e.g. identified events in information extraction). In this study, we focus on the genre and domain differences of the context-dependent semantic uncertainty cue recognition task. We introduce a unified subcategorization of semantic uncertainty as different domain applications can apply different uncertainty categories. Based on this categorization, we normalized the annotation of three corpora and present results with a state-of-the-art uncertainty cue recognition model for four fine-grained categories of semantic uncertainty. Our results reveal the domain and genre dependence of the problem; nevertheless, we also show that even a distant source domain dataset can contribute to the recognition and disambiguation of uncertainty cues, efficiently reducing the annotation costs needed to cover a new domain. Thus, the unified subcategorization and domain adaptation for training the models offer an efficient solution for cross-domain and cross-genre semantic uncertainty recognition. |
Freie Schlagworte: | UKP_p_SIDIM;UKP_a_WALL |
ID-Nummer: | TUD-CS-2012-0035 |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung |
Hinterlegungsdatum: | 31 Dez 2016 14:29 |
Letzte Änderung: | 24 Jan 2020 12:03 |
PPN: | |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |