Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RTP-LX: Can LLMs Evaluate Toxicity in Multilingual Scenarios? (2404.14397v2)

Published 22 Apr 2024 in cs.CL, cs.CY, and cs.LG

Abstract: LLMs and small LLMs (SLMs) are being adopted at remarkable speed, although their safety still remains a serious concern. With the advent of multilingual S/LLMs, the question now becomes a matter of scale: can we expand multilingual safety evaluations of these models with the same velocity at which they are deployed? To this end, we introduce RTP-LX, a human-transcreated and human-annotated corpus of toxic prompts and outputs in 28 languages. RTP-LX follows participatory design practices, and a portion of the corpus is especially designed to detect culturally-specific toxic language. We evaluate 10 S/LLMs on their ability to detect toxic content in a culturally-sensitive, multilingual scenario. We find that, although they typically score acceptably in terms of accuracy, they have low agreement with human judges when scoring holistically the toxicity of a prompt; and have difficulty discerning harm in context-dependent scenarios, particularly with subtle-yet-harmful content (e.g. microaggressions, bias). We release this dataset to contribute to further reduce harmful uses of these models and improve their safe deployment.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (33)
  1. Adrian de Wynter (20 papers)
  2. Ishaan Watts (4 papers)
  3. Nektar Ege Altıntoprak (1 paper)
  4. Tua Wongsangaroonsri (1 paper)
  5. Minghui Zhang (42 papers)
  6. Noura Farra (6 papers)
  7. Lena Baur (1 paper)
  8. Samantha Claudet (1 paper)
  9. Pavel Gajdusek (1 paper)
  10. Can Gören (1 paper)
  11. Qilong Gu (8 papers)
  12. Anna Kaminska (15 papers)
  13. Ruby Kuo (1 paper)
  14. Akiko Kyuba (1 paper)
  15. Jongho Lee (38 papers)
  16. Kartik Mathur (1 paper)
  17. Petter Merok (1 paper)
  18. Nani Paananen (1 paper)
  19. Vesa-Matti Paananen (1 paper)
  20. Anna Pavlenko (4 papers)
Citations (9)