Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SemEval-2024 Task 8: Multidomain, Multimodel and Multilingual Machine-Generated Text Detection (2404.14183v1)

Published 22 Apr 2024 in cs.CL

Abstract: We present the results and the main findings of SemEval-2024 Task 8: Multigenerator, Multidomain, and Multilingual Machine-Generated Text Detection. The task featured three subtasks. Subtask A is a binary classification task determining whether a text is written by a human or generated by a machine. This subtask has two tracks: a monolingual track focused solely on English texts and a multilingual track. Subtask B is to detect the exact source of a text, discerning whether it is written by a human or generated by a specific LLM. Subtask C aims to identify the changing point within a text, at which the authorship transitions from human to machine. The task attracted a large number of participants: subtask A monolingual (126), subtask A multilingual (59), subtask B (70), and subtask C (30). In this paper, we present the task, analyze the results, and discuss the system submissions and the methods they used. For all subtasks, the best systems used LLMs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (15)
  1. Yuxia Wang (41 papers)
  2. Jonibek Mansurov (14 papers)
  3. Petar Ivanov (4 papers)
  4. Jinyan Su (20 papers)
  5. Artem Shelmanov (29 papers)
  6. Akim Tsvigun (12 papers)
  7. Osama Mohammed Afzal (9 papers)
  8. Tarek Mahmoud (7 papers)
  9. Giovanni Puccetti (12 papers)
  10. Thomas Arnold (13 papers)
  11. Chenxi Whitehouse (17 papers)
  12. Alham Fikri Aji (94 papers)
  13. Nizar Habash (66 papers)
  14. Iryna Gurevych (264 papers)
  15. Preslav Nakov (253 papers)
Citations (30)