TU Darmstadt / ULB / TUbiblio

Multilingual Knowledge in Aligned Wiktionary and OmegaWiki for Translation Applications

Matuschek, Michael and Meyer, Christian M. and Gurevych, Iryna
Rehm, Georg and Stein, Daniel and Sasaki, Felix and Witt, Andreas (eds.) (2018):
Multilingual Knowledge in Aligned Wiktionary and OmegaWiki for Translation Applications.
In: Language technologies for a multilingual Europe. TC 3 III, Berlin, Language Science Press, pp. 139-180, DOI: 10.5281/zenodo.1291940,
[Online-Edition: http://langsci-press.org//catalog/book/106],
[Book Section]

Abstract

Multilingual lexical-semantic resources play an important role in translation applications. However, multilingual resources with sufficient quality and coverage are rare as the effort of manually constructing such a resource is substantial. In recent years, the emergence of Web 2.0 has opened new possibilities for constructing large-scale lexical-semantic resources. We identified Wiktionary and OmegaWiki as two important multilingual initiatives where a community of users (“crowd”) collaboratively edits and refines the lexical information. They seem especially appropriate in the multilingual domain as users from all languages and cultures can easily contribute. However, despite their advantages such as open access and coverage of multiple languages, these resources have hardly been systematically investigated and utilized until now. Therefore, the goals of our contribution are threefold: (1) We analyze how these resources emerged and characterize their content and structure; (2) We propose an alignment at the word sense level to exploit the complementary information contained in both resources for increased coverage; (3) We describe a mapping of the resources to a standardized, unified model (UBY-LMF) thus creating a large freely available multilingual resource designed for easy integration into applications such as machine translation or computer-aided translation environments.

Item Type: Book Section
Erschienen: 2018
Editors: Rehm, Georg and Stein, Daniel and Sasaki, Felix and Witt, Andreas
Creators: Matuschek, Michael and Meyer, Christian M. and Gurevych, Iryna
Title: Multilingual Knowledge in Aligned Wiktionary and OmegaWiki for Translation Applications
Language: English
Abstract:

Multilingual lexical-semantic resources play an important role in translation applications. However, multilingual resources with sufficient quality and coverage are rare as the effort of manually constructing such a resource is substantial. In recent years, the emergence of Web 2.0 has opened new possibilities for constructing large-scale lexical-semantic resources. We identified Wiktionary and OmegaWiki as two important multilingual initiatives where a community of users (“crowd”) collaboratively edits and refines the lexical information. They seem especially appropriate in the multilingual domain as users from all languages and cultures can easily contribute. However, despite their advantages such as open access and coverage of multiple languages, these resources have hardly been systematically investigated and utilized until now. Therefore, the goals of our contribution are threefold: (1) We analyze how these resources emerged and characterize their content and structure; (2) We propose an alignment at the word sense level to exploit the complementary information contained in both resources for increased coverage; (3) We describe a mapping of the resources to a standardized, unified model (UBY-LMF) thus creating a large freely available multilingual resource designed for easy integration into applications such as machine translation or computer-aided translation environments.

Title of Book: Language technologies for a multilingual Europe. TC 3 III
Series Name: Translation and Multilingual Natural Language Processing
Volume: 4
Place of Publication: Berlin
Publisher: Language Science Press
ISBN: 978-3-946234-73-9
Uncontrolled Keywords: UKP_s_JOWKL;UKP_p_UBY;UKP_a_LangTech4eHum;reviewed;UKP_reviewed
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Ubiquitous Knowledge Processing
Date Deposited: 31 Dec 2016 14:29
DOI: 10.5281/zenodo.1291940
Official URL: http://langsci-press.org//catalog/book/106
Additional Information:

Reprint of Matuschek et al. (2013).

Export:
Suche nach Titel in: TUfind oder in Google

Optionen (nur für Redakteure)

View Item View Item