Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality (2402.13954v2)

Published 21 Feb 2024 in cs.CL

Abstract: Innovative transformer-based LLMs produce contextually-aware token embeddings and have achieved state-of-the-art performance for a variety of natural language tasks, but have been shown to encode unwanted biases for downstream applications. In this paper, we evaluate the social biases encoded by transformers trained with the masked LLMing objective using proposed proxy functions within an iterative masking experiment to measure the quality of transformer models' predictions, and assess the preference of MLMs towards disadvantaged and advantaged groups. We compare bias estimations with those produced by other evaluation methods using benchmark datasets and assess their alignment with human annotated biases. We find relatively high religious and disability biases across considered MLMs and low gender bias in one dataset relative to another. We extend on previous work by evaluating social biases introduced after retraining an MLM under the masked LLMing objective, and find that proposed measures produce more accurate estimations of biases introduced by retraining MLMs than others based on relative preference for biased sentences between models.

View on arXiv

References (26)

Authors (2)

Rahul Zalkikar (1 paper)
Kanchan Chandra (1 paper)

Citations (1)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Measuring Social Biases in Masked Language Models by Proxy of Prediction Quality (2402.13954v2)

Summary

Related Papers