Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Steering LLMs Towards Unbiased Responses: A Causality-Guided Debiasing Framework (2403.08743v1)

Published 13 Mar 2024 in cs.CL, cs.AI, and cs.LG

Abstract: LLMs can easily generate biased and discriminative responses. As LLMs tap into consequential decision-making (e.g., hiring and healthcare), it is of crucial importance to develop strategies to mitigate these biases. This paper focuses on social bias, tackling the association between demographic information and LLM outputs. We propose a causality-guided debiasing framework that utilizes causal understandings of (1) the data-generating process of the training corpus fed to LLMs, and (2) the internal reasoning process of LLM inference, to guide the design of prompts for debiasing LLM outputs through selection mechanisms. Our framework unifies existing de-biasing prompting approaches such as inhibitive instructions and in-context contrastive examples, and sheds light on new ways of debiasing by encouraging bias-free reasoning. Our strong empirical performance on real-world datasets demonstrates that our framework provides principled guidelines on debiasing LLM outputs even with only the black-box access.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Jingling Li (12 papers)
  2. Zeyu Tang (20 papers)
  3. Xiaoyu Liu (138 papers)
  4. Peter Spirtes (23 papers)
  5. Kun Zhang (353 papers)
  6. Liu Leqi (26 papers)
  7. Yang Liu (2253 papers)
Citations (10)