Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

RECAST: Interactive Auditing of Automatic Toxicity Detection Models (2001.01819v2)

Published 7 Jan 2020 in cs.CL, cs.CY, and cs.LG

Abstract: As toxic language becomes nearly pervasive online, there has been increasing interest in leveraging the advancements in NLP, from very large transformer models to automatically detecting and removing toxic comments. Despite the fairness concerns, lack of adversarial robustness, and limited prediction explainability for deep learning systems, there is currently little work for auditing these systems and understanding how they work for both developers and users. We present our ongoing work, RECAST, an interactive tool for examining toxicity detection models by visualizing explanations for predictions and providing alternative wordings for detected toxic speech.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Austin P. Wright (13 papers)
  2. Omar Shaikh (23 papers)
  3. Haekyu Park (21 papers)
  4. Will Epperson (9 papers)
  5. Muhammed Ahmed (2 papers)
  6. Stephane Pinel (2 papers)
  7. Diyi Yang (151 papers)
  8. Duen Horng Chau (109 papers)
Citations (6)