Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NativQA: Multilingual Culturally-Aligned Natural Query for LLMs (2407.09823v2)

Published 13 Jul 2024 in cs.CL and cs.AI

Abstract: Natural Question Answering (QA) datasets play a crucial role in evaluating the capabilities of LLMs, ensuring their effectiveness in real-world applications. Despite the numerous QA datasets that have been developed, there is a notable lack of region-specific datasets generated by native users in their own languages. This gap hinders the effective benchmarking of LLMs for regional and cultural specificities. Furthermore, it also limits the development of fine-tuned models. In this study, we propose a scalable, language-independent framework, NativQA, to seamlessly construct culturally and regionally aligned QA datasets in native languages, for LLM evaluation and tuning. We demonstrate the efficacy of the proposed framework by designing a multilingual natural QA dataset, \mnqa, consisting of ~64k manually annotated QA pairs in seven languages, ranging from high to extremely low resource, based on queries from native speakers from 9 regions covering 18 topics. We benchmark open- and closed-source LLMs with the MultiNativQA dataset. We also showcase the framework efficacy in constructing fine-tuning data especially for low-resource and dialectally-rich languages. We made both the framework NativQA and MultiNativQA dataset publicly available for the community (https://nativqa.gitlab.io).

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Md. Arid Hasan (13 papers)
  2. Maram Hasanain (24 papers)
  3. Fatema Ahmad (4 papers)
  4. Sahinur Rahman Laskar (3 papers)
  5. Sunaya Upadhyay (1 paper)
  6. Mucahid Kutlu (23 papers)
  7. Shammur Absar Chowdhury (31 papers)
  8. Firoj Alam (75 papers)
  9. Vrunda N Sukhadia (2 papers)
Citations (2)