TU Darmstadt / ULB / TUbiblio

Representation and Interchange of Linguistic Annotation. An In-Depth, Side-by-Side Comparison of Three Designs

Eckart de Castilho, Richard and Ide, Nancy and Lapponi, Emanuele and Oepen, Stephan and Suderman, Keith and Velldal, Erik and Verhagen, Marc :
Representation and Interchange of Linguistic Annotation. An In-Depth, Side-by-Side Comparison of Three Designs.
[Online-Edition: http://www.aclweb.org/anthology/W17-0808]
Proceedings of the 11th Linguistics Annotation Workshop (LAW XI) at EACL 2017 Association for Computational Linguistics
[Conference or Workshop Item] , (2017)

Official URL: http://www.aclweb.org/anthology/W17-0808

Abstract

For decades, most self-respecting linguistic engineering initiatives have designed and implemented custom representations for various layers of, for example, morphological, syntactic, and semantic analysis. Despite occasional efforts at harmonization or even standardization, our field today is blessed with a multitude of ways of encoding and exchanging linguistic annotations of these types, both at the levels of ‘abstract syntax’, naming choices, and of course file formats. To a large degree, it is possible to work within and across design plurality by conversion, and often there may be good reasons for divergent design reflecting differences in use. However, it is likely that some abstract commonalities across choices of representation are obscured by more superficial differences, and conversely there is no obvious procedure to tease apart what actually constitute contentful vs. mere technical divergences. In this study, we seek to conceptually align three representations for common types of morpho-syntactic analysis, pinpoint what in our view constitute contentful differences, and reflect on the underlying principles and specific requirements that led to individual choices. We expect that a more in-depth understanding of these choices across designs may led to increased harmonization, or at least to more informed design of future representations.

Item Type: Conference or Workshop Item
Erschienen: 2017
Creators: Eckart de Castilho, Richard and Ide, Nancy and Lapponi, Emanuele and Oepen, Stephan and Suderman, Keith and Velldal, Erik and Verhagen, Marc
Title: Representation and Interchange of Linguistic Annotation. An In-Depth, Side-by-Side Comparison of Three Designs
Language: English
Abstract:

For decades, most self-respecting linguistic engineering initiatives have designed and implemented custom representations for various layers of, for example, morphological, syntactic, and semantic analysis. Despite occasional efforts at harmonization or even standardization, our field today is blessed with a multitude of ways of encoding and exchanging linguistic annotations of these types, both at the levels of ‘abstract syntax’, naming choices, and of course file formats. To a large degree, it is possible to work within and across design plurality by conversion, and often there may be good reasons for divergent design reflecting differences in use. However, it is likely that some abstract commonalities across choices of representation are obscured by more superficial differences, and conversely there is no obvious procedure to tease apart what actually constitute contentful vs. mere technical divergences. In this study, we seek to conceptually align three representations for common types of morpho-syntactic analysis, pinpoint what in our view constitute contentful differences, and reflect on the underlying principles and specific requirements that led to individual choices. We expect that a more in-depth understanding of these choices across designs may led to increased harmonization, or at least to more informed design of future representations.

Title of Book: Proceedings of the 11th Linguistics Annotation Workshop (LAW XI) at EACL 2017
Publisher: Association for Computational Linguistics
Uncontrolled Keywords: reviewed;UKP_p_DKPro;UKP_reviewed;UKP_s_DKPro_Core;CEDIFOR;UKP_p_OpenMinTeD
Divisions: Department of Computer Science
Department of Computer Science > Ubiquitous Knowledge Processing
DFG-Graduiertenkollegs
DFG-Graduiertenkollegs > Research Training Group 1994 Adaptive Preparation of Information from Heterogeneous Sources
Date Deposited: 22 Feb 2017 13:44
Official URL: http://www.aclweb.org/anthology/W17-0808
Identification Number: TUD-CS-2017-0042
Export:

Optionen (nur für Redakteure)

View Item View Item