LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation (2402.11485v2)

Published 18 Feb 2024 in cs.CL, cs.AI, and cs.LG

Abstract: Adapting English-based LLMs to other languages has become increasingly popular due to the efficiency and potential of cross-lingual transfer. However, existing language adaptation methods often overlook the benefits of cross-lingual supervision. In this study, we introduce LEIA, a language adaptation tuning method that utilizes Wikipedia entity names aligned across languages. This method involves augmenting the target language corpus with English entity names and training the model using left-to-right LLMing. We assess LEIA on diverse question answering datasets using 7B-parameter LLMs, demonstrating significant performance gains across various non-English languages. The source code is available at https://github.com/studio-ousia/leia.

References (35)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - studio-ousia/leia: LEIA: Facilitating Cross-Lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation (21 stars)

Tweets

https://twitter.com/knishimae0531/status/1799231978824413329

LEIA: Facilitating Cross-lingual Knowledge Transfer in Language Models with Entity-based Data Augmentation (2402.11485v2)

Summary

Related Papers

GitHub

Tweets