Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information (2403.09516v3)

Published 14 Mar 2024 in cs.CL, cs.LG, and cs.CY

Abstract: Mitigating social biases typically requires identifying the social groups associated with each data sample. In this paper, we present DAFair, a novel approach to address social bias in LLMs. Unlike traditional methods that rely on explicit demographic labels, our approach does not require any such information. Instead, we leverage predefined prototypical demographic texts and incorporate a regularization term during the fine-tuning process to mitigate bias in the model's representations. Our empirical results across two tasks and two models demonstrate the effectiveness of our method compared to previous approaches that do not rely on labeled data. Moreover, with limited demographic-annotated data, our approach outperforms common debiasing approaches.

References (18)

Citations (3)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/shadi_isk/status/1802828612925546986

Leveraging Prototypical Representations for Mitigating Social Bias without Demographic Information (2403.09516v3)

Summary

Related Papers

Tweets