Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models (2305.03025v1)

Published 4 May 2023 in cs.CL and cs.AI

Abstract: This project focuses on enhancing open-source LLMs through instruction-tuning and providing comprehensive evaluations of their performance. We explore how various training data factors, such as quantity, quality, and linguistic distribution, influence the performance of instruction-tuned models trained on publicly accessible high-quality instruction datasets for both English and Chinese languages. Our goal is to supplement evaluation with quantitative analyses, providing valuable insights for the continued advancement of open-source chat models. Our model, data, and code are publicly available for others to use and build upon.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Fangkai Jiao (19 papers)
Bosheng Ding (16 papers)
Tianze Luo (11 papers)
Zhanfeng Mo (3 papers)

Citations (3)

View on Semantic Scholar

Panda LLM: Training Data and Evaluation for Open-Sourced Chinese Instruction-Following Large Language Models (2305.03025v1)

Related Papers