Threads of Subtlety: Detecting Machine-Generated Texts Through Discourse Motifs
Abstract: With the advent of LLMs (LLM), the line between human-crafted and machine-generated texts has become increasingly blurred. This paper delves into the inquiry of identifying discernible and unique linguistic properties in texts that were written by humans, particularly uncovering the underlying discourse structures of texts beyond their surface structures. Introducing a novel methodology, we leverage hierarchical parse trees and recursive hypergraphs to unveil distinctive discourse patterns in texts produced by both LLMs and humans. Empirical findings demonstrate that, although both LLMs and humans generate distinct discourse patterns influenced by specific domains, human-written texts exhibit more structural variability, reflecting the nuanced nature of human writing in different domains. Notably, incorporating hierarchical discourse features enhances binary classifiers' overall performance in distinguishing between human-written and machine-generated texts, even on out-of-distribution and paraphrased samples. This underscores the significance of incorporating hierarchical discourse features in the analysis of text patterns. The code and dataset are available at https://github.com/minnesotanlp/threads-of-subtlety.
- Graphframex: Towards systematic evaluation of explainability methods for graph neural networks.
- Towards a robust detection of language model-generated text: Is ChatGPT that easy to detect? In Actes de CORIA-TALN 2023. Actes de la 30e Conférence sur le Traitement Automatique des Langues Naturelles (TALN), volume 1 : travaux de recherche originaux – articles longs, pages 14–27, Paris, France. ATALA.
- Don’t lose the message while paraphrasing: A study on content preserving style transfer. In Natural Language Processing and Information Systems, pages 47–61, Cham. Springer Nature Switzerland.
- Longformer: The long-document transformer.
- Daria Beresneva. 2016. Computer-generated text detection using machine learning: A systematic review. In International Conference on Applications of Natural Language to Data Bases.
- Conda: Contrastive domain adaptation for ai-generated text detection.
- Amrita Bhattacharjee and Huan Liu. 2023. Fighting fire with fire: Can chatgpt detect ai-generated text?
- Leo Breiman. 2001. Random forests. Machine Learning, 45(1):5–32.
- Counter Turing test (CT2): AI-generated text detection is not as easy as you may think - introducing AI detectability index (ADI). In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 2206–2239, Singapore. Association for Computational Linguistics.
- A machine learning approach to the automatic evaluation of machine translation. In Proceedings of the 39th Annual Meeting of the Association for Computational Linguistics, pages 148–155, Toulouse, France. Association for Computational Linguistics.
- Doraid Dalalah and Osama M.A. Dalalah. 2023. The false positives and false negatives of generative ai detection tools in education and academic research: The case of chatgpt. The International Journal of Management Education, 21(2):100822.
- Is GPT-3 text indistinguishable from human text? scarecrow: A framework for scrutinizing machine text. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 7250–7274, Dublin, Ireland. Association for Computational Linguistics.
- Real or fake text?: Investigating human ability to detect boundaries between human-written and machine-generated text. Proceedings of the AAAI Conference on Artificial Intelligence, 37(11):12763–12771.
- Andy Extance. 2023. Chatgpt has entered the classroom: how llms could transform education. Nature, 623(7987):474–477.
- Edyta Gołąb-Andrzejak. 2023. The impact of generative ai and chatgpt on creating digital advertising campaigns. Cybernetics and Systems, page 1–15.
- How close is chatgpt to human experts? comparison corpus, evaluation, and detection.
- Mgtbench: Benchmarking machine-generated text detection.
- Patrick Huber and Giuseppe Carenini. 2022. Towards understanding large-scale discourse structures in pre-trained and fine-tuned language models. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 2376–2394, Seattle, United States. Association for Computational Linguistics.
- Cliff A Joslyn and Kathleen E. Nowak. 2017. Ubergraphs: A definition of a recursive hypergraph structure. ArXiv, abs/1704.05547.
- A watermark for large language models. In Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pages 17061–17084. PMLR.
- Outfox: Llm-generated essay detection through in-context learning with adversarially generated examples. ArXiv, abs/2307.11729.
- Openassistant conversations - democratizing large language model alignment. ArXiv, abs/2304.07327.
- Paraphrasing evades detectors of AI-generated text, but retrieval is an effective defense. In Thirty-seventh Conference on Neural Information Processing Systems.
- Alex Lascarides and Nicholas Asher. 2007. Segmented discourse representation theory: Dynamic semantics with discourse structure. In Computing meaning, pages 87–124. Springer.
- Deepfake text detection in the wild.
- Coco: Coherence-enhanced machine-generated text detection under data limitation with contrastive learning.
- Argugpt: evaluating, understanding and identifying argumentative essays generated by gpt models.
- Check me if you can: Detecting chatgpt-generated academic writing using checkgpt.
- DMRST: A joint framework for document-level multilingual RST discourse segmentation and parsing. In Proceedings of the 2nd Workshop on Computational Approaches to Discourse, pages 154–164, Punta Cana, Dominican Republic and Online. Association for Computational Linguistics.
- William C Mann and Sandra A Thompson. 1987. Rhetorical structure theory: A theory of text organization. University of Southern California, Information Sciences Institute Los Angeles.
- Network motifs: Simple building blocks of complex networks. Science, 298(5594):824–827.
- The Penn Discourse Treebank. In Proceedings of the Fourth International Conference on Language Resources and Evaluation (LREC’04), Lisbon, Portugal. European Language Resources Association (ELRA).
- Crosslingual generalization through multitask finetuning. In Proceedings of the 61st Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 15991–16111, Toronto, Canada. Association for Computational Linguistics.
- Training language models to follow instructions with human feedback. In Advances in Neural Information Processing Systems, volume 35, pages 27730–27744. Curran Associates, Inc.
- Sudha Rao and Joel Tetreault. 2018. Dear sir or madam, may I introduce the GYAFC dataset: Corpus, benchmarks and metrics for formality style transfer. In Proceedings of the 2018 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long Papers), pages 129–140, New Orleans, Louisiana. Association for Computational Linguistics.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3982–3992, Hong Kong, China. Association for Computational Linguistics.
- Can ai-generated text be reliably detected?
- Weisfeiler-lehman graph kernels. J. Mach. Learn. Res., 12:2539–2561.
- Red teaming language model detectors with language models.
- Karen Sparck Jones. 1972. A statistical interpretation of term specificity and its application in retrieval. Journal of documentation, 28(1):11–21.
- DetectLLM: Leveraging log rank information for zero-shot detection of machine-generated text. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 12395–12412, Singapore. Association for Computational Linguistics.
- Multiplex network motifs as building blocks of corporate networks. Applied Network Science, 3.
- Does human collaboration enhance the accuracy of identifying llm-generated deepfake texts?
- Howkgpt: Investigating the detection of chatgpt-generated university student homework through context-aware perplexity analysis.
- Graph Attention Networks. International Conference on Learning Representations.
- Ghostbuster: Detecting text ghostwritten by large language models.
- A survey on llm-generated text detection: Necessity, methods, and future directions.
- Predicting discourse trees from transformer-based neural summarizers. In Proceedings of the 2021 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 4139–4152, Online. Association for Computational Linguistics.
- Is chatgpt involved in texts? measure the polish ratio to detect chatgpt-generated text.
- Gnnexplainer: Generating explanations for graph neural networks. In Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc.
- Gpt paternity test: Gpt generated text detection with gpt genetic inheritance. ArXiv, abs/2305.12519.
- Amir Zeldes. 2016. rstweb-a browser-based annotation interface for rhetorical structure theory and discourse relations. In Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Demonstrations, pages 1–5.
- Defending against neural fake news. ArXiv, abs/1905.12616.
- Pegasus: pre-training with extracted gap-sentences for abstractive summarization. In Proceedings of the 37th International Conference on Machine Learning, ICML’20. JMLR.org.
- Distillation-resistant watermarking for model protection in NLP. In Findings of the Association for Computational Linguistics: EMNLP 2022, pages 5044–5055, Abu Dhabi, United Arab Emirates. Association for Computational Linguistics.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.