Meta4XNLI: A Crosslingual Parallel Corpus for Metaphor Detection and Interpretation (2404.07053v1)
Abstract: Metaphors, although occasionally unperceived, are ubiquitous in our everyday language. Thus, it is crucial for LLMs to be able to grasp the underlying meaning of this kind of figurative language. In this work, we present Meta4XNLI, a novel parallel dataset for the tasks of metaphor detection and interpretation that contains metaphor annotations in both Spanish and English. We investigate LLMs' metaphor identification and understanding abilities through a series of monolingual and cross-lingual experiments by leveraging our proposed corpus. In order to comprehend how these non-literal expressions affect models' performance, we look over the results and perform an error analysis. Additionally, parallel data offers many potential opportunities to investigate metaphor transferability between these languages and the impact of translation on the development of multilingual annotated resources.
- Combining abstractness and language-specific theoretical indicators for detecting non-literal usage of Estonian particle verbs. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Student Research Workshop, pages 9–16, Association for Computational Linguistics, New Orleans, Louisiana, USA.
- Agerri, Rodrigo. 2008. Metaphor in Textual Entailment. In COLING, pages 3–6.
- Metaphors in pre-trained language models: Probing and generalization across datasets and languages. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2037–2050, Association for Computational Linguistics, Dublin, Ireland.
- Antloga, Špela. 2020. Korpus metafor komet 1.0. In Proceedings of the Conference on Language Technologies and Digital Humanities (Student abstracts), pages 167–170.
- Translation artifacts in cross-lingual transfer learning. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 7674–7684, Association for Computational Linguistics, Online.
- MIss RoBERTa WiLDe: Metaphor Identification Using Masked Language Model with Wiktionary Lexical Definitions. Applied Sciences, 12(4).
- A match made in heaven: A multi-task framework for hyperbole and metaphor detection. In Findings of the Association for Computational Linguistics: ACL 2023, pages 388–401, Association for Computational Linguistics, Toronto, Canada.
- Berger, Maria. 2022. Transfer learning parallel metaphor using bilingual embeddings. In Proceedings of the 3rd Workshop on Figurative Language Processing (FLP), pages 13–23, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid).
- Birke, Julia and Anoop Sarkar. 2006. A Clustering Approach for the Nearly Unsupervised Recognition of Nonliteral Language. In 11th Conference of the European Chapter of the Association for Computational Linguistics.
- Bizzoni, Yuri and Mehdi Ghanimifard. 2018. Bigrams and BiLSTMs Two neural networks for sequential metaphor detection. In Proceedings of the Workshop on Figurative Language Processing, pages 91–101.
- Bizzoni, Yuri and Shalom Lappin. 2018. Predicting human metaphor paraphrase judgments with deep neural networks. In Proceedings of the Workshop on Figurative Language Processing, pages 45–55, Association for Computational Linguistics, New Orleans, Louisiana.
- Black, M. 1962. Models and Metaphors: Studies in Language and Philosophy. Studies in language and philosophy. Cornell University Press.
- Construction artifacts in metaphor identification datasets.
- Bollegala, Danushka and Ekaterina Shutova. 2013. Metaphor Interpretation Using Paraphrases Extracted from the Web . PloS one, 8(9):e74304.
- Figurative language in recognizing textual entailment. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 3354–3361, Association for Computational Linguistics, Online.
- FLUTE: Figurative language understanding through textual explanations. In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing, pages 7139–7159, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates.
- Charteris-Black, Jonathan. 2004. Corpus Approaches to Critical Metaphor Analysis. Springer.
- Charteris-Black, Jonathan. 2011. Metaphor in political discourse. In Politicians and Rhetoric: The Persuasive Power of Metaphor, pages 28–51, Palgrave Macmillan UK, London.
- MelBERT: Metaphor Detection via Contextualized Late Interaction using Metaphorical Identification Theories. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, NAACL-HLT, pages 1763–1773, Association for Computational Linguistics.
- Cohen, Jacob. 1960. A coefficient of agreement for nominal scales. Educational and Psychological Measurement, 20:37 – 46.
- MiQA: A benchmark for inference on metaphorical questions. In Proceedings of the 2nd Conference of the Asia-Pacific Chapter of the Association for Computational Linguistics and the 12th International Joint Conference on Natural Language Processing (Volume 2: Short Papers), pages 373–381, Association for Computational Linguistics, Online only.
- Unsupervised Cross-lingual Representation Learning at Scale.
- XNLI: Evaluating cross-lingual sentence representations. In Proceedings of the 2018 Conference on Empirical Methods in Natural Language Processing, pages 2475–2485, Association for Computational Linguistics, Brussels, Belgium.
- Can yes-no question-answering models be useful for few-shot metaphor detection? In Proceedings of the 3rd Workshop on Figurative Language Processing (FLP), pages 125–130, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid).
- BERT: Pre-training of deep bidirectional transformers for language understanding. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4171–4186, Association for Computational Linguistics, Minneapolis, Minnesota.
- Feng, Huawen and Qianli Ma. 2022. It’s better to teach fishing than giving a fish: An auto-augmented structure-aware generative model for metaphor detection. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 656–667, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates.
- The FrameNet database and software tools. In Proceedings of the Third International Conference on Language Resources and Evaluation (LREC’02), European Language Resources Association (ELRA), Las Palmas, Canary Islands - Spain.
- Model and data transfer for cross-lingual sequence labelling in zero-resource settings. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 6403–6416, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates.
- Debertav3: Improving deberta using electra-style pre-training with gradient-disentangled embedding sharing. ArXiv, abs/2111.09543.
- NewsMet : A ‘do it all’ dataset of contemporary metaphors in news headlines. In Findings of the Association for Computational Linguistics: ACL 2023, pages 10090–10104, Association for Computational Linguistics, Toronto, Canada.
- Multi-lingual and multi-cultural figurative language understanding. In Findings of the Association for Computational Linguistics: ACL 2023, pages 8269–8284, Association for Computational Linguistics, Toronto, Canada.
- Köper, Maximilian and Sabine Schulte im Walde. 2016. Distinguishing literal and non-literal usage of German particle verbs. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 353–362, Association for Computational Linguistics, San Diego, California.
- Multilingual multi-figurative language detection. In Findings of the Association for Computational Linguistics: ACL 2023, pages 9254–9267, Association for Computational Linguistics, Toronto, Canada.
- Lakoff, George and Mark Johnson. 1980. Metaphors We Live By.
- Improving hate speech type and target detection with hateful metaphor features. In Proceedings of the fourth workshop on NLP for internet freedom: censorship, disinformation, and propaganda, pages 7–16.
- A report on the 2020 VUA and TOEFL metaphor detection shared task. In Proceedings of the Second Workshop on Figurative Language Processing, pages 18–29, Association for Computational Linguistics, Online.
- A Report on the 2018 VUA Metaphor Detection Shared Task. In Proceedings of the Workshop on Figurative Language Processing, pages 56–66, Association for Computational Linguistics, New Orleans, Louisiana.
- Resources for the detection of conventionalized metaphors in four languages. In Proceedings of the Ninth International Conference on Language Resources and Evaluation (LREC’14), pages 498–501, European Language Resources Association (ELRA), Reykjavik, Iceland.
- Do supervised distributional methods really learn lexical inference relations? In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 970–976, Association for Computational Linguistics, Denver, Colorado.
- Metaphor detection via explicit basic meanings modelling. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 91–100, Association for Computational Linguistics, Toronto, Canada.
- FrameBERT: Conceptual metaphor detection with frame embedding learning. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 1558–1563, Association for Computational Linguistics, Dubrovnik, Croatia.
- CATE: A contrastive pre-trained model for metaphor detection with semi-supervised learning. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 3888–3898, Association for Computational Linguistics, Online and Punta Cana, Dominican Republic.
- Testing the ability of language models to interpret figurative language. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4437–4452, Association for Computational Linguistics, Seattle, United States.
- RoBERTa: A Robustly Optimized BERT Pretraining Approach.
- Word embedding and WordNet based metaphor identification and interpretation. In Proceedings of the 56th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1222–1231, Association for Computational Linguistics, Melbourne, Australia.
- Interpreting verbal metaphors by paraphrasing.
- Maudslay, Rowan Hall and Simone Teufel. 2022. Metaphorical polysemy detection: Conventional metaphor meets word sense disambiguation. In Proceedings of the 29th International Conference on Computational Linguistics, pages 65–77, International Committee on Computational Linguistics, Gyeongju, Republic of Korea.
- Metaphor as a medium for emotion: An empirical study. In Proceedings of the Fifth Joint Conference on Lexical and Computational Semantics, pages 23–33, Association for Computational Linguistics, Berlin, Germany.
- Introducing the LCC metaphor datasets. In Proceedings of the Tenth International Conference on Language Resources and Evaluation (LREC’16), pages 4221–4227, European Language Resources Association (ELRA), Portorož, Slovenia.
- Applying Textual Entailment to the Interpretation of Metaphor . In 2013 IEEE Seventh International Conference on Semantic Computing, pages 118–125, IEEE.
- Literal, metphorical or both? detecting metaphoricity in isolated adjective-noun phrases. In Proceedings of the Workshop on Figurative Language Processing, pages 27–33, Association for Computational Linguistics, New Orleans, Louisiana.
- An analysis of language models for metaphor recognition. In Proceedings of the 28th International Conference on Computational Linguistics, pages 3722–3736, International Committee on Computational Linguistics, Barcelona, Spain (Online).
- A howling success or a working sea? testing what BERT knows about metaphors. In Proceedings of the Fourth BlackboxNLP Workshop on Analyzing and Interpreting Neural Networks for NLP, pages 192–204, Association for Computational Linguistics, Punta Cana, Dominican Republic.
- Percy, Walker. 1958. Metaphor as Mistake. The Sewanee Review, 66(1):79–99.
- MEAN: Metaphoric erroneous ANalogies dataset for PTLMs metaphor knowledge probing. In Proceedings of the 4th Conference on Language, Data and Knowledge, pages 147–152, NOVA CLUNL, Portugal, Vienna, Austria.
- How metaphors impact political discourse: A large-scale topic-agnostic study using neural metaphor detection. In Proceedings of the International AAAI Conference on Web and Social Media, volume 15, pages 503–512.
- Exploring the limits of transfer learning with a unified text-to-text transformer.
- Rakshit, Geetanjali and Jeffrey Flanigan. 2023. Does the "most sinfully decadent cake ever" taste good? answering yes/no questions from figurative contexts.
- Paper bullets: Modeling propaganda with the help of metaphor. In Findings of the Association for Computational Linguistics: EACL 2023, pages 472–489.
- A report on the FigLang 2022 shared task on understanding figurative language. In Proceedings of the 3rd Workshop on Figurative Language Processing (FLP), pages 178–183, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid).
- Sanchez-Bayona, Elisa and Rodrigo Agerri. 2022. Leveraging a new Spanish corpus for multilingual and cross-lingual metaphor detection. In Proceedings of the 26th Conference on Computational Natural Language Learning (CoNLL), pages 228–240, Association for Computational Linguistics, Abu Dhabi, United Arab Emirates (Hybrid).
- Schäffner, Christina. 2004. Metaphor and translation: some implications of a cognitive approach. Journal of Pragmatics, 36:1253–1269.
- Schuster, Jakob and Katja Markert. 2023. Nut-cracking sledgehammers: Prioritizing target language data over bigger language models for cross-lingual metaphor detection. In Proceedings of the 2023 CLASP Conference on Learning with Small Data (LSD), pages 98–106, Association for Computational Linguistics, Gothenburg, Sweden.
- Searle, John R. 1979. Expression and meaning: Studies in the theory of speech acts. Cambridge University Press.
- Semino, Elena. 2017. Corpus linguistics and metaphor. The Cambridge Handbook of Cognitive Linguistics, pages 463–476.
- Shutova, Ekaterina. 2010. Automatic Metaphor Interpretation as a Paraphrasing Task. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, pages 1029–1037, Association for Computational Linguistics.
- Shutova, Ekaterina. 2013. Metaphor identification as interpretation. In International Workshop on Semantic Evaluation.
- Unsupervised metaphor paraphrasing using a vector space model. In International Conference on Computational Linguistics.
- Multilingual metaphor processing: Experiments with semi-supervised and unsupervised learning. Computational Linguistics, 43(1):71–123.
- Statistical Metaphor Processing. Computational Linguistics, 39(2):301–353.
- Verb metaphor detection via contextual relation learning. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 4240–4251, Association for Computational Linguistics, Online.
- A method for linguistic metaphor identification. From MIP to MIPVU.
- Stowe, Kevin and Martha Palmer. 2018. Leveraging syntactic constructions for metaphor identification. In Proceedings of the Workshop on Figurative Language Processing, pages 17–26, Association for Computational Linguistics, New Orleans, Louisiana.
- IMPLI: Investigating NLI models’ performance on figurative language. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 5375–5388, Association for Computational Linguistics, Dublin, Ireland.
- Recent advances in neural metaphor processing: A linguistic, cognitive and social perspective. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4673–4686, Association for Computational Linguistics, Online.
- Metaphor Detection with Cross-Lingual Model Transfer. In Proceedings of the 52nd Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 248–258.
- Wachowiak, Lennart and Dagmar Gromann. 2023. Does GPT-3 grasp metaphors? identifying metaphor mappings with generative language models. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1018–1032, Association for Computational Linguistics, Toronto, Canada.
- Enhancing metaphor detection by gloss-based interpretations. In Findings of the Association for Computational Linguistics: ACL-IJCNLP 2021, pages 1971–1981, Association for Computational Linguistics, Online.
- Metaphor detection with effective context denoising. In Proceedings of the 17th Conference of the European Chapter of the Association for Computational Linguistics, pages 1404–1409, Association for Computational Linguistics, Dubrovnik, Croatia.
- Wilks, Yorick. 1975. A preferential, pattern-seeking, Semantics for natural language inference. Artificial Intelligence, 6(1):53–74.
- Wilks, Yorick. 1978. Making preferences more active. Artificial Intelligence, 11(3):197–223.
- A broad-coverage challenge corpus for sentence understanding through inference. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 1112–1122, Association for Computational Linguistics, New Orleans, Louisiana.
- Crowd-sourcing a high-quality dataset for metaphor identification in tweets. In International Conference on Language, Data, and Knowledge.
- Figure me out: A gold standard dataset for metaphor interpretation. In Proceedings of the Twelfth Language Resources and Evaluation Conference, pages 5810–5819, European Language Resources Association, Marseille, France.
- Zhang, Shenglong and Ying Liu. 2022. Metaphor detection via linguistics enhanced Siamese network. In Proceedings of the 29th International Conference on Computational Linguistics, pages 4149–4159, International Committee on Computational Linguistics, Gyeongju, Republic of Korea.
- Zhang, Shenglong and Ying Liu. 2023. Adversarial multi-task learning for end-to-end metaphor detection. In Findings of the Association for Computational Linguistics: ACL 2023, pages 1483–1497, Association for Computational Linguistics, Toronto, Canada.