Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets (2402.08015v5)

Published 12 Feb 2024 in cs.CL

Abstract: LLMs have received a lot of attention in NLP research because of their exceptional performance in understanding and generating human languages. However, low-resource languages are left behind due to the unavailability of resources. In this work, we focus on enhancing the LLaMA-2-Amharic model by integrating task-specific and generative datasets to improve LLM performance for Amharic. We compile an Amharic instruction fine-tuning dataset and fine-tuned LLaMA-2-Amharic model. The fine-tuned model shows promising results in different NLP tasks. We open-source our dataset creation pipeline, instruction datasets, trained models, and evaluation outputs to promote language-specific studies on these models.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (39)

Authors (9)

Israel Abebe Azime (16 papers)
Mitiku Yohannes Fuge (1 paper)
Atnafu Lambebo Tonja (27 papers)
Tadesse Destaw Belay (12 papers)
Aman Kassahun Wassie (4 papers)
Eyasu Shiferaw Jada (1 paper)
Yonas Chanie (3 papers)
Walelign Tewabe Sewunetie (2 papers)
Seid Muhie Yimam (41 papers)

Tweets

https://twitter.com/MasakhaneNLP/status/1784285596568920224

https://twitter.com/Israel_Abebe/status/1787794914979278864

Walia-LLM: Enhancing Amharic-LLaMA by Integrating Task-Specific and Generative Datasets (2402.08015v5)

Related Papers

Tweets