Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MAPLE: Multilingual Evaluation of Parameter Efficient Finetuning of Large Language Models (2401.07598v3)

Published 15 Jan 2024 in cs.CL

Abstract: Parameter Efficient Finetuning (PEFT) has emerged as a viable solution for improving the performance of LLMs without requiring massive resources and compute. Prior work on multilingual evaluation has shown that there is a large gap between the performance of LLMs on English and other languages. Further, there is also a large gap between the performance of smaller open-source models and larger LLMs. Finetuning can be an effective way to bridge this gap and make LLMs more equitable. In this work, we finetune the LLama-2-7B and Mistral-7B models on two synthetic multilingual instruction tuning datasets to determine its effect on model performance on six downstream tasks covering forty languages in all. Additionally, we experiment with various parameters, such as rank for low-rank adaptation and values of quantisation to determine their effects on downstream performance and find that higher rank and higher quantisation values benefit low-resource languages. We find that PEFT of smaller open-source models sometimes bridges the gap between the performance of these models and the larger ones, however, English performance can take a hit. We also find that finetuning sometimes improves performance on low-resource languages, while degrading performance on high-resource languages.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Divyanshu Aggarwal (9 papers)
  2. Ashutosh Sathe (9 papers)
  3. Sunayana Sitaram (54 papers)
  4. Ishaan Watts (4 papers)
Citations (1)
X Twitter Logo Streamline Icon: https://streamlinehq.com