Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Who Wrote This? The Key to Zero-Shot LLM-Generated Text Detection Is GECScore (2405.04286v1)

Published 7 May 2024 in cs.CL

Abstract: The efficacy of an LLM generated text detector depends substantially on the availability of sizable training data. White-box zero-shot detectors, which require no such data, are nonetheless limited by the accessibility of the source model of the LLM-generated text. In this paper, we propose an simple but effective black-box zero-shot detection approach, predicated on the observation that human-written texts typically contain more grammatical errors than LLM-generated texts. This approach entails computing the Grammar Error Correction Score (GECScore) for the given text to distinguish between human-written and LLM-generated text. Extensive experimental results show that our method outperforms current state-of-the-art (SOTA) zero-shot and supervised methods, achieving an average AUROC of 98.7% and showing strong robustness against paraphrase and adversarial perturbation attacks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Junchao Wu (9 papers)
  2. Runzhe Zhan (12 papers)
  3. Derek F. Wong (69 papers)
  4. Shu Yang (178 papers)
  5. Xuebo Liu (54 papers)
  6. Lidia S. Chao (41 papers)
  7. Min Zhang (630 papers)
Citations (3)
X Twitter Logo Streamline Icon: https://streamlinehq.com