The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding (2107.09931v1)

Published 21 Jul 2021 in cs.CL and cs.LG

Abstract: While recent benchmarks have spurred a lot of new work on improving the generalization of pretrained multilingual LLMs on multilingual tasks, techniques to improve code-switched natural language understanding tasks have been far less explored. In this work, we propose the use of bilingual intermediate pretraining as a reliable technique to derive large and consistent performance gains on three different NLP tasks using code-switched text. We achieve substantial absolute improvements of 7.87%, 20.15%, and 10.99%, on the mean accuracies and F1 scores over previous state-of-the-art systems for Hindi-English Natural Language Inference (NLI), Question Answering (QA) tasks, and Spanish-English Sentiment Analysis (SA) respectively. We show consistent performance gains on four different code-switched language-pairs (Hindi-English, Spanish-English, Tamil-English and Malayalam-English) for SA. We also present a code-switched masked LLMling (MLM) pretraining technique that consistently benefits SA compared to standard MLM pretraining using real code-switched text.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Archiki Prasad (18 papers)
Mohammad Ali Rehan (2 papers)
Shreya Pathak (12 papers)
Preethi Jyothi (51 papers)

Citations (9)

View on Semantic Scholar

The Effectiveness of Intermediate-Task Training for Code-Switched Natural Language Understanding (2107.09931v1)

Related Papers