SeqXGPT: Sentence-Level AI-Generated Text Detection (2310.08903v2)

Published 13 Oct 2023 in cs.CL

Abstract: Widely applied LLMs can generate human-like content, raising concerns about the abuse of LLMs. Therefore, it is important to build strong AI-generated text (AIGT) detectors. Current works only consider document-level AIGT detection, therefore, in this paper, we first introduce a sentence-level detection challenge by synthesizing a dataset that contains documents that are polished with LLMs, that is, the documents contain sentences written by humans and sentences modified by LLMs. Then we propose \textbf{Seq}uence \textbf{X} (Check) \textbf{GPT}, a novel method that utilizes log probability lists from white-box LLMs as features for sentence-level AIGT detection. These features are composed like \textit{waves} in speech processing and cannot be studied by LLMs. Therefore, we build SeqXGPT based on convolution and self-attention networks. We test it in both sentence and document-level detection challenges. Experimental results show that previous methods struggle in solving sentence-level AIGT detection, while our method not only significantly surpasses baseline methods in both sentence and document-level detection challenges but also exhibits strong generalization capabilities.

References (39)

Citations (35)

View on Semantic Scholar

Summary

The paper introduces a novel detection paradigm that leverages word-wise log probabilities processed by CNNs and self-attention for precise sentence-level AI text detection.
It proposes the SeqXGPT-Bench dataset featuring mixed human and AI-generated sentences to enable fine-grained evaluation across multiple detection tasks.
Experiments demonstrate that SeqXGPT outperforms baseline methods like DetectGPT, showing strong generalization and enhanced security in identifying AI contributions.

SeqXGPT: Sentence-Level AI-Generated Text Detection

The paper "SeqXGPT: Sentence-Level AI-Generated Text Detection" presents a novel approach to addressing the challenge of detecting AI-generated text at the sentence level. With the wide application of LLMs capable of generating human-like text, the paper highlights the growing concern about potential misuse and emphasizes the need for robust detection mechanisms.

Overview

The authors introduce a sentence-level detection paradigm in response to the limitations of existing document-level methods. They propose a new dataset, SeqXGPT-Bench, containing documents with both human-written and AI-generated sentences to facilitate this fine-grained detection. The paper investigates various detection tasks, including particular-model binary, mixed-model binary, and mixed-model multiclass AIGT detection, and emphasizes the difficulty of sentence-level detection due to short text inputs.

Methodology

SeqXGPT is presented as a dedicated solution for sentence-level detection, leveraging word-wise log probability lists extracted from LLMs as foundational features. These features are processed using convolutional and self-attention networks, resembling the handling of wave-like temporal features in speech processing. By employing sequence labeling for text provenance analysis, SeqXGPT exhibits superior performance in predicting sentence origins compared to existing approaches.

Experimental Findings

Extensive experimentation demonstrates SeqXGPT's efficacy over previous methods. The approach surpasses baseline models, such as log probability-based and perturbation-based methods like DetectGPT, particularly in the sentence-level detection challenge. SeqXGPT also shows promising results in document-level detection and maintains strong generalization across out-of-distribution datasets. The paper details ablation studies that underscore the importance of combining CNNs with self-attention layers for effective detection performance.

Implications and Future Directions

The paper's contribution provides a significant step towards more precise AI-generated text detection, offering potential for enhancing security and trust in LLM applications. Though focusing heavily on structural features through log probabilities, further research could explore the integration of semantic features to improve detection in human-like sentence generation scenarios. Moreover, the role of various instructional settings in AI-generated text remains to be investigated, which could enhance the adaptability and robustness of SeqXGPT in real-world applications where AI and human-authored content coexist.

Conclusion

"SeqXGPT: Sentence-Level AI-Generated Text Detection" addresses an emerging need for refined detection capabilities amidst increasingly sophisticated AI text generators. By achieving finer granularity in text detection, it sets a foundation for future exploration into the amalgamation of semantic and structural features in identifying AI contributions to human-composed documents, ensuring that detection systems remain ahead in an evolving landscape of AI-assisted content creation.

PDF Markdown

Related Papers

GitHub

GitHub - Jihuai-wpy/SeqXGPT: SeqXGPT: An advance method for sentence-level AI-generated text detection. (90 stars)