Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation (2304.01746v1)

Published 4 Apr 2023 in cs.CL

Abstract: ChatGPT, a large-scale LLM based on the advanced GPT-3.5 architecture, has shown remarkable potential in various NLP tasks. However, there is currently a dearth of comprehensive study exploring its potential in the area of Grammatical Error Correction (GEC). To showcase its capabilities in GEC, we design zero-shot chain-of-thought (CoT) and few-shot CoT settings using in-context learning for ChatGPT. Our evaluation involves assessing ChatGPT's performance on five official test sets in three different languages, along with three document-level GEC test sets in English. Our experimental results and human evaluations demonstrate that ChatGPT has excellent error detection capabilities and can freely correct errors to make the corrected sentences very fluent, possibly due to its over-correction tendencies and not adhering to the principle of minimal edits. Additionally, its performance in non-English and low-resource settings highlights its potential in multilingual GEC tasks. However, further analysis of various types of errors at the document-level has shown that ChatGPT cannot effectively correct agreement, coreference, tense errors across sentences, and cross-sentence boundary errors.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Tao Fang (19 papers)
  2. Shu Yang (178 papers)
  3. Kaixin Lan (2 papers)
  4. Derek F. Wong (69 papers)
  5. Jinpeng Hu (10 papers)
  6. Lidia S. Chao (41 papers)
  7. Yue Zhang (618 papers)
Citations (91)