Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
51 tokens/sec
GPT-4o
60 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
8 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MULTITuDE: Large-Scale Multilingual Machine-Generated Text Detection Benchmark (2310.13606v1)

Published 20 Oct 2023 in cs.CL and cs.AI

Abstract: There is a lack of research into capabilities of recent LLMs to generate convincing text in languages other than English and into performance of detectors of machine-generated text in multilingual settings. This is also reflected in the available benchmarks which lack authentic texts in languages other than English and predominantly cover older generators. To fill this gap, we introduce MULTITuDE, a novel benchmarking dataset for multilingual machine-generated text detection comprising of 74,081 authentic and machine-generated texts in 11 languages (ar, ca, cs, de, en, es, nl, pt, ru, uk, and zh) generated by 8 multilingual LLMs. Using this benchmark, we compare the performance of zero-shot (statistical and black-box) and fine-tuned detectors. Considering the multilinguality, we evaluate 1) how these detectors generalize to unseen languages (linguistically similar as well as dissimilar) and unseen LLMs and 2) whether the detectors improve their performance when trained on multiple languages.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Dominik Macko (13 papers)
  2. Robert Moro (22 papers)
  3. Adaku Uchendu (16 papers)
  4. Jason Samuel Lucas (3 papers)
  5. Michiharu Yamashita (8 papers)
  6. Matúš Pikuliak (12 papers)
  7. Ivan Srba (28 papers)
  8. Thai Le (38 papers)
  9. Dongwon Lee (65 papers)
  10. Jakub Simko (18 papers)
  11. Maria Bielikova (27 papers)