TU Darmstadt / ULB / TUbiblio

How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models

Rust, Phillip ; Pfeiffer, Jonas ; Vulić, Ivan ; Ruder, Sebastian ; Gurevych, Iryna (2021):
How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models.
In: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pp. 3118-3135,
Association for Computational Linguistics, 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021), virtual Conference, 01.-06.08.2021, [Conference or Workshop Item]

Item Type: Conference or Workshop Item
Erschienen: 2021
Creators: Rust, Phillip ; Pfeiffer, Jonas ; Vulić, Ivan ; Ruder, Sebastian ; Gurevych, Iryna
Title: How Good is Your Tokenizer? On the Monolingual Performance of Multilingual Language Models
Language: English
Book Title: Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers)
Publisher: Association for Computational Linguistics
Uncontrolled Keywords: UKP_p_FAMULUS, emergenCITY_INF
Divisions: 20 Department of Computer Science
20 Department of Computer Science > Ubiquitous Knowledge Processing
LOEWE
LOEWE > LOEWE-Zentren
LOEWE > LOEWE-Zentren > emergenCITY
Event Title: 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (ACL-IJCNLP 2021)
Event Location: virtual Conference
Event Dates: 01.-06.08.2021
Date Deposited: 10 May 2021 07:07
URL / URN: https://aclanthology.org/2021.acl-long.243
PPN:
Corresponding Links:
Export:
Suche nach Titel in: TUfind oder in Google
Send an inquiry Send an inquiry

Options (only for editors)
Show editorial Details Show editorial Details