PrOnto: Language Model Evaluations for 859 Languages (2305.12612v2)

Published 22 May 2023 in cs.CL

Abstract: Evaluation datasets are critical resources for measuring the quality of pretrained LLMs. However, due to the high cost of dataset annotation, these resources are scarce for most languages other than English, making it difficult to assess the quality of LLMs. In this work, we present a new method for evaluation dataset construction which enables any language with a New Testament translation to receive a suite of evaluation datasets suitable for pretrained LLM evaluation. The method critically involves aligning verses with those in the New Testament portion of English OntoNotes, and then projecting annotations from English to the target language, with no manual annotation required. We apply this method to 1051 New Testament translations in 859 and make them publicly available. Additionally, we conduct experiments which demonstrate the efficacy of our method for creating evaluation tasks which can assess LLM quality.

References (30)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

PrOnto: Language Model Evaluations for 859 Languages (2305.12612v2)

Summary

Related Papers