Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Crosslingual Retrieval Augmented In-context Learning for Bangla (2311.00587v2)

Published 1 Nov 2023 in cs.CL

Abstract: The promise of LLMs in Natural Language Processing has often been overshadowed by their limited performance in low-resource languages such as Bangla. To address this, our paper presents a pioneering approach that utilizes cross-lingual retrieval augmented in-context learning. By strategically sourcing semantically similar prompts from high-resource language, we enable multilingual pretrained LLMs (MPLMs), especially the generative model BLOOMZ, to successfully boost performance on Bangla tasks. Our extensive evaluation highlights that the cross-lingual retrieval augmented prompts bring steady improvements to MPLMs over the zero-shot performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Xiaoqian Li (10 papers)
  2. Ercong Nie (25 papers)
  3. Sheng Liang (11 papers)
Citations (6)