Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models (2306.16322v1)

Published 28 Jun 2023 in cs.CL

Abstract: LLMs have demonstrated impressive performance on various downstream tasks without requiring fine-tuning, including ChatGPT, a chat-based model built on top of LLMs such as GPT-3.5 and GPT-4. Despite having a lower training proportion compared to English, these models also exhibit remarkable capabilities in other languages. In this study, we assess the performance of GPT-3.5 and GPT-4 models on seven distinct Arabic NLP tasks: sentiment analysis, translation, transliteration, paraphrasing, part of speech tagging, summarization, and diacritization. Our findings reveal that GPT-4 outperforms GPT-3.5 on five out of the seven tasks. Furthermore, we conduct an extensive analysis of the sentiment analysis task, providing insights into how LLMs achieve exceptional results on a challenging dialectal dataset. Additionally, we introduce a new Python interface https://github.com/ARBML/Taqyim that facilitates the evaluation of these tasks effortlessly.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (48)

Authors (6)

Zaid Alyafeai (21 papers)
Maged S. Alshaibani (2 papers)
Badr AlKhamissi (24 papers)
Hamzah Luqman (12 papers)
Ebrahim Alareqi (3 papers)
Ali Fadel (5 papers)

Citations (12)

View on Semantic Scholar

GitHub

GitHub - ARBML/Taqyim: Python intefrace for evaluation on chatgpt models (19 stars)

Taqyim: Evaluating Arabic NLP Tasks Using ChatGPT Models (2306.16322v1)

Related Papers

GitHub