Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DUPE: Detection Undermining via Prompt Engineering for Deepfake Text (2404.11408v1)

Published 17 Apr 2024 in cs.AI

Abstract: As LLMs become increasingly commonplace, concern about distinguishing between human and AI text increases as well. The growing power of these models is of particular concern to teachers, who may worry that students will use LLMs to write school assignments. Facing a technology with which they are unfamiliar, teachers may turn to publicly-available AI text detectors. Yet the accuracy of many of these detectors has not been thoroughly verified, posing potential harm to students who are falsely accused of academic dishonesty. In this paper, we evaluate three different AI text detectors-Kirchenbauer et al. watermarks, ZeroGPT, and GPTZero-against human and AI-generated essays. We find that watermarking results in a high false positive rate, and that ZeroGPT has both high false positive and false negative rates. Further, we are able to significantly increase the false negative rate of all detectors by using ChatGPT 3.5 to paraphrase the original AI-generated texts, thereby effectively bypassing the detectors.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (12)
  1. Harald Baayen. Statistical Models for Word Frequency Distributions: A Linguistic Evaluation. Computers and the Humanities, 1992(26):347–363, December 1992.
  2. Universal Sentence Encoder, March 2018.
  3. On the Possibilities of AI-Generated Text Detection, April 2023.
  4. A Watermark for Large Language Models. In Proceedings of the 40th International Conference on Machine Learning, 2023.
  5. On the Reliability of Watermarks for Large Language Models, June 2023.
  6. GPT Detectors Are Biased Against Non-Native English Writers, April 2023.
  7. DetectGPT: Zero-Shot Machine-Generated Text Detection using Probability Curvature, January 2023.
  8. The Regents of the University of Michigan. Michigan Corpus of Upper-level Student Papers, 2009.
  9. Deepfake Text Detection: Limitations and Opportunities. 2023.
  10. Can AI-Generated Text be Reliably Detected?, March 2023.
  11. Generalizing to Unseen Domains: A Survey on Domain Generalization. In Proceedings of the Thirtieth, pages 4627–4635, Montreal, Canada, 2021.
  12. Testing of Detection Tools for AI-Generated Text, June 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. James Weichert (4 papers)
  2. Chinecherem Dimobi (1 paper)
X Twitter Logo Streamline Icon: https://streamlinehq.com