TU Darmstadt / ULB / TUbiblio

IMPLI: Investigating NLI Models’ Performance on Figurative Language

Stowe, Kevin ; Utama, Prasetya ; Gurevych, Iryna (2022)
IMPLI: Investigating NLI Models’ Performance on Figurative Language.
60th Annual Meeting of the Association for Computational Linguistics. Dublin, Ireland (22.-27.05.2022)
Konferenzveröffentlichung, Bibliographie

Kurzbeschreibung (Abstract)

Natural language inference (NLI) has been widely used as a task to train and evaluate models for language understanding. However, the ability of NLI models to perform inferences requiring understanding of figurative language such as idioms and metaphors remains understudied. We introduce the IMPLI (Idiomatic and Metaphoric Paired Language Inference) dataset, an English dataset consisting of paired sentences spanning idioms and metaphors. We develop novel methods to generate 24k semiautomatic pairs as well as manually creating 1.8k gold pairs. We use IMPLI to evaluate NLI models based on RoBERTa fine-tuned on the widely used MNLI dataset. We then show that while they can reliably detect entailment relationship between figurative phrases with their literal counterparts, they perform poorly on similarly structured examples where pairs are designed to be non-entailing. This suggests the limits of current NLI models with regard to understanding figurative language and this dataset serves as a benchmark for future improvements in this direction.

Typ des Eintrags: Konferenzveröffentlichung
Erschienen: 2022
Autor(en): Stowe, Kevin ; Utama, Prasetya ; Gurevych, Iryna
Art des Eintrags: Bibliographie
Titel: IMPLI: Investigating NLI Models’ Performance on Figurative Language
Sprache: Englisch
Publikationsjahr: 17 Mai 2022
Verlag: ACL
Buchtitel: Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers)
Veranstaltungstitel: 60th Annual Meeting of the Association for Computational Linguistics
Veranstaltungsort: Dublin, Ireland
Veranstaltungsdatum: 22.-27.05.2022
URL / URN: https://aclanthology.org/2022.acl-long.369/
Kurzbeschreibung (Abstract):

Natural language inference (NLI) has been widely used as a task to train and evaluate models for language understanding. However, the ability of NLI models to perform inferences requiring understanding of figurative language such as idioms and metaphors remains understudied. We introduce the IMPLI (Idiomatic and Metaphoric Paired Language Inference) dataset, an English dataset consisting of paired sentences spanning idioms and metaphors. We develop novel methods to generate 24k semiautomatic pairs as well as manually creating 1.8k gold pairs. We use IMPLI to evaluate NLI models based on RoBERTa fine-tuned on the widely used MNLI dataset. We then show that while they can reliably detect entailment relationship between figurative phrases with their literal counterparts, they perform poorly on similarly structured examples where pairs are designed to be non-entailing. This suggests the limits of current NLI models with regard to understanding figurative language and this dataset serves as a benchmark for future improvements in this direction.

Fachbereich(e)/-gebiet(e): 20 Fachbereich Informatik
20 Fachbereich Informatik > Ubiquitäre Wissensverarbeitung
Hinterlegungsdatum: 19 Mai 2022 10:01
Letzte Änderung: 14 Nov 2022 14:31
PPN: 501650954
Export:
Suche nach Titel in: TUfind oder in Google
Frage zum Eintrag Frage zum Eintrag

Optionen (nur für Redakteure)
Redaktionelle Details anzeigen Redaktionelle Details anzeigen