Multilingual Few-Shot Learning via Language Model Retrieval (2306.10964v1)

Published 19 Jun 2023 in cs.CL

Abstract: Transformer-based LLMs have achieved remarkable success in few-shot in-context learning and drawn a lot of research interest. However, these models' performance greatly depends on the choice of the example prompts and also has high variability depending on how samples are chosen. In this paper, we conduct a comprehensive study of retrieving semantically similar few-shot samples and using them as the context, as it helps the model decide the correct label without any gradient update in the multilingual and cross-lingual settings. We evaluate the proposed method on five natural language understanding datasets related to intent detection, question classification, sentiment analysis, and topic classification. The proposed method consistently outperforms random sampling in monolingual and cross-lingual tasks in non-English languages.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (38)

Authors (4)

Genta Indra Winata (94 papers)
Liang-Kang Huang (3 papers)
Soumya Vadlamannati (1 paper)
Yash Chandarana (3 papers)

Citations (2)

View on Semantic Scholar

Multilingual Few-Shot Learning via Language Model Retrieval (2306.10964v1)

Related Papers