Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models (2404.17010v1)

Published 25 Apr 2024 in cs.CL and cs.AI

Abstract: The developments that LLMs have provided in fulfilling almost all kinds of tasks have attracted the attention of not only researchers but also the society and have enabled them to become products. There are commercially successful LLMs available. However, users may prefer open-source LLMs due to cost, data privacy, or regulations. Yet, despite the increasing number of these models, there is no comprehensive comparison of their performance for Turkish. This study aims to fill this gap in the literature. A comparison is made among seven selected LLMs based on their contextual learning and question-answering abilities. Turkish datasets for contextual learning and question-answering were prepared, and both automatic and human evaluations were conducted. The results show that for question-answering, continuing pretraining before fine-tuning with instructional datasets is more successful in adapting multilingual models to Turkish and that in-context learning performances do not much related to question-answering performances.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (20)

Authors (9)

Eren Dogan (10 papers)
M. Egemen Uzun (4 papers)
Atahan Uz (4 papers)
H. Emre Seyrek (2 papers)
Ahmed Zeer (4 papers)
Ezgi Sevi (1 paper)
H. Toprak Kesgin (6 papers)
M. Kaan Yuce (4 papers)
M. Fatih Amasyali (7 papers)

Türkçe Dil Modellerinin Performans Karşılaştırması Performance Comparison of Turkish Language Models (2404.17010v1)

Related Papers

Tweets