Stowe, Kevin ; Utama, Prasetya ; Gurevych, Iryna (2022)
IMPLI: Investigating NLI Models’ Performance on Figurative Language.
60th Annual Meeting of the Association for Computational Linguistics. Dublin, Ireland (22.05.2022-27.05.2022)
Konferenzveröffentlichung, Bibliographie
Kurzbeschreibung (Abstract)
Natural language inference (NLI) has been widely used as a task to train and evaluate models for language understanding. However, the ability of NLI models to perform inferences requiring understanding of figurative language such as idioms and metaphors remains understudied. We introduce the IMPLI (Idiomatic and Metaphoric Paired Language Inference) dataset, an English dataset consisting of paired sentences spanning idioms and metaphors. We develop novel methods to generate 24k semiautomatic pairs as well as manually creating 1.8k gold pairs. We use IMPLI to evaluate NLI models based on RoBERTa fine-tuned on the widely used MNLI dataset. We then show that while they can reliably detect entailment relationship between figurative phrases with their literal counterparts, they perform poorly on similarly structured examples where pairs are designed to be non-entailing. This suggests the limits of current NLI models with regard to understanding figurative language and this dataset serves as a benchmark for future improvements in this direction.
Typ des Eintrags: | Konferenzveröffentlichung |
---|---|
Erschienen: | 2022 |
Autor(en): | Stowe, Kevin ; Utama, Prasetya ; Gurevych, Iryna |
Art des Eintrags: | Bibliographie |
Titel: | IMPLI: Investigating NLI Models’ Performance on Figurative Language |
Sprache: | Englisch |
Publikationsjahr: | 17 Mai 2022 |
Verlag: | ACL |
Buchtitel: | Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers) |
Veranstaltungstitel: | 60th Annual Meeting of the Association for Computational Linguistics |
Veranstaltungsort: | Dublin, Ireland |
Veranstaltungsdatum: | 22.05.2022-27.05.2022 |
URL / URN: | https://aclanthology.org/2022.acl-long.369/ |
Kurzbeschreibung (Abstract): | Natural language inference (NLI) has been widely used as a task to train and evaluate models for language understanding. However, the ability of NLI models to perform inferences requiring understanding of figurative language such as idioms and metaphors remains understudied. We introduce the IMPLI (Idiomatic and Metaphoric Paired Language Inference) dataset, an English dataset consisting of paired sentences spanning idioms and metaphors. We develop novel methods to generate 24k semiautomatic pairs as well as manually creating 1.8k gold pairs. We use IMPLI to evaluate NLI models based on RoBERTa fine-tuned on the widely used MNLI dataset. We then show that while they can reliably detect entailment relationship between figurative phrases with their literal counterparts, they perform poorly on similarly structured examples where pairs are designed to be non-entailing. This suggests the limits of current NLI models with regard to understanding figurative language and this dataset serves as a benchmark for future improvements in this direction. |
Fachbereich(e)/-gebiet(e): | 20 Fachbereich Informatik 20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung |
Hinterlegungsdatum: | 19 Mai 2022 10:01 |
Letzte Änderung: | 14 Nov 2022 14:31 |
PPN: | 501650954 |
Export: | |
Suche nach Titel in: | TUfind oder in Google |
Frage zum Eintrag |
Optionen (nur für Redakteure)
Redaktionelle Details anzeigen |