2000 character limit reached
Crosslingual Retrieval Augmented In-context Learning for Bangla (2311.00587v2)
Published 1 Nov 2023 in cs.CL
Abstract: The promise of LLMs in Natural Language Processing has often been overshadowed by their limited performance in low-resource languages such as Bangla. To address this, our paper presents a pioneering approach that utilizes cross-lingual retrieval augmented in-context learning. By strategically sourcing semantically similar prompts from high-resource language, we enable multilingual pretrained LLMs (MPLMs), especially the generative model BLOOMZ, to successfully boost performance on Bangla tasks. Our extensive evaluation highlights that the cross-lingual retrieval augmented prompts bring steady improvements to MPLMs over the zero-shot performance.
- Xiaoqian Li (10 papers)
- Ercong Nie (25 papers)
- Sheng Liang (11 papers)