Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CLEME2.0: Towards More Interpretable Evaluation by Disentangling Edits for Grammatical Error Correction (2407.00934v1)

Published 1 Jul 2024 in cs.CL

Abstract: The paper focuses on improving the interpretability of Grammatical Error Correction (GEC) metrics, which receives little attention in previous studies. To bridge the gap, we propose CLEME2.0, a reference-based evaluation strategy that can describe four elementary dimensions of GEC systems, namely hit-correction, error-correction, under-correction, and over-correction. They collectively contribute to revealing the critical characteristics and locating drawbacks of GEC systems. Evaluating systems by Combining these dimensions leads to high human consistency over other reference-based and reference-less metrics. Extensive experiments on 2 human judgement datasets and 6 reference datasets demonstrate the effectiveness and robustness of our method. All the codes will be released after the peer review.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Jingheng Ye (15 papers)
  2. Zishan Xu (8 papers)
  3. Yinghui Li (65 papers)
  4. Xuxin Cheng (42 papers)
  5. Linlin Song (1 paper)
  6. Qingyu Zhou (28 papers)
  7. Hai-Tao Zheng (94 papers)
  8. Ying Shen (76 papers)
  9. Xin Su (67 papers)