Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

BiasAlert: A Plug-and-play Tool for Social Bias Detection in LLMs (2407.10241v2)

Published 14 Jul 2024 in cs.CL

Abstract: Evaluating the bias in LLMs becomes increasingly crucial with their rapid development. However, existing evaluation methods rely on fixed-form outputs and cannot adapt to the flexible open-text generation scenarios of LLMs (e.g., sentence completion and question answering). To address this, we introduce BiasAlert, a plug-and-play tool designed to detect social bias in open-text generations of LLMs. BiasAlert integrates external human knowledge with inherent reasoning capabilities to detect bias reliably. Extensive experiments demonstrate that BiasAlert significantly outperforms existing state-of-the-art methods like GPT4-as-A-Judge in detecting bias. Furthermore, through application studies, we demonstrate the utility of BiasAlert in reliable LLM bias evaluation and bias mitigation across various scenarios. Model and code will be publicly released.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Zhiting Fan (4 papers)
  2. Ruizhe Chen (32 papers)
  3. Ruiling Xu (1 paper)
  4. Zuozhu Liu (78 papers)
Citations (6)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets