2000 character limit reached
Ace-CEFR -- A Dataset for Automated Evaluation of the Linguistic Difficulty of Conversational Texts for LLM Applications (2506.14046v1)
Published 16 Jun 2025 in cs.CL and cs.AI
Abstract: There is an unmet need to evaluate the language difficulty of short, conversational passages of text, particularly for training and filtering LLMs. We introduce Ace-CEFR, a dataset of English conversational text passages expert-annotated with their corresponding level of text difficulty. We experiment with several models on Ace-CEFR, including Transformer-based models and LLMs. We show that models trained on Ace-CEFR can measure text difficulty more accurately than human experts and have latency appropriate to production environments. Finally, we release the Ace-CEFR dataset to the public for research and development.