Selection Bias Induced Spurious Correlations in Large Language Models

Published 18 Jul 2022 in cs.CL and cs.AI | (2207.08982v1)

Abstract: In this work we show how LLMs can learn statistical dependencies between otherwise unconditionally independent variables due to dataset selection bias. To demonstrate the effect, we developed a masked gender task that can be applied to BERT-family models to reveal spurious correlations between predicted gender pronouns and a variety of seemingly gender-neutral variables like date and location, on pre-trained (unmodified) BERT and RoBERTa large models. Finally, we provide an online demo, inviting readers to experiment further.