TU Darmstadt / ULB / TUbiblio

Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations

Sukhareva, Maria and Eckle-Kohler, Judith and Habernal, Ivan and Gurevych, Iryna (2016):
Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations.
In: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016), European Language Resources Association (ELRA), Portoroz, Slovenia, [Online-Edition: http://www.lrec-conf.org/proceedings/lrec2016/pdf/494_Paper....],
[Conference or Workshop Item]

Abstract

We present a new large dataset of 12403 context-sensitive verb relations manually annotated via crowdsourcing. These relations capture fine-grained semantic information between verb-centric propositions, such as temporal or entailment relations. We propose a novel semantic verb relation scheme and design a multi-step annotation approach for scaling-up the annotations using crowdsourcing. We employ several quality measures and report on agreement scores. The resulting dataset is available under a permissive CreativeCommons license. It represents a valuable resource for various applications, such as automatic information consolidation or automatic summarization.

Item Type: Conference or Workshop Item
Erschienen: 2016
Creators: Sukhareva, Maria and Eckle-Kohler, Judith and Habernal, Ivan and Gurevych, Iryna
Title: Crowdsourcing a Large Dataset of Domain-Specific Context-Sensitive Semantic Verb Relations
Language: English
Abstract:

We present a new large dataset of 12403 context-sensitive verb relations manually annotated via crowdsourcing. These relations capture fine-grained semantic information between verb-centric propositions, such as temporal or entailment relations. We propose a novel semantic verb relation scheme and design a multi-step annotation approach for scaling-up the annotations using crowdsourcing. We employ several quality measures and report on agreement scores. The resulting dataset is available under a permissive CreativeCommons license. It represents a valuable resource for various applications, such as automatic information consolidation or automatic summarization.

Title of Book: Proceedings of the 10th International Conference on Language Resources and Evaluation (LREC 2016)
Publisher: European Language Resources Association (ELRA)
Uncontrolled Keywords: UKP_reviewed;UKP_p_DIP;Crowdsourcing, Semantic relations, dataset
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Ubiquitous Knowledge Processing
DFG-Graduiertenkollegs
DFG-Graduiertenkollegs > Research Training Group 1994 Adaptive Preparation of Information from Heterogeneous Sources
Event Location: Portoroz, Slovenia
Date Deposited: 31 Dec 2016 14:29
Official URL: http://www.lrec-conf.org/proceedings/lrec2016/pdf/494_Paper....
Identification Number: TUD-CS-2016-0021
Related URLs:
Export:

Optionen (nur für Redakteure)

View Item View Item