Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Towards an Understanding and Explanation for Mixed-Initiative Artificial Scientific Text Detection (2304.05011v1)

Published 11 Apr 2023 in cs.HC and cs.CL

Abstract: LLMs have gained popularity in various fields for their exceptional capability of generating human-like text. Their potential misuse has raised social concerns about plagiarism in academic contexts. However, effective artificial scientific text detection is a non-trivial task due to several challenges, including 1) the lack of a clear understanding of the differences between machine-generated and human-written scientific text, 2) the poor generalization performance of existing methods caused by out-of-distribution issues, and 3) the limited support for human-machine collaboration with sufficient interpretability during the detection process. In this paper, we first identify the critical distinctions between machine-generated and human-written scientific text through a quantitative experiment. Then, we propose a mixed-initiative workflow that combines human experts' prior knowledge with machine intelligence, along with a visual analytics prototype to facilitate efficient and trustworthy scientific text detection. Finally, we demonstrate the effectiveness of our approach through two case studies and a controlled user study with proficient researchers. We also provide design implications for interactive artificial text detection tools in high-stakes decision-making scenarios.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Luoxuan Weng (6 papers)
  2. Minfeng Zhu (25 papers)
  3. Kam Kwai Wong (7 papers)
  4. Shi Liu (75 papers)
  5. Jiashun Sun (1 paper)
  6. Hang Zhu (16 papers)
  7. Dongming Han (5 papers)
  8. Wei Chen (1290 papers)
Citations (8)