Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Understanding the Interplay of Scale, Data, and Bias in Language Models: A Case Study with BERT (2407.21058v1)

Published 25 Jul 2024 in cs.CL and cs.AI

Abstract: In the current landscape of LLM research, larger models, larger datasets and more compute seems to be the only way to advance towards intelligence. While there have been extensive studies of scaling laws and models' scaling behaviors, the effect of scale on a model's social biases and stereotyping tendencies has received less attention. In this study, we explore the influence of model scale and pre-training data on its learnt social biases. We focus on BERT -- an extremely popular LLM -- and investigate biases as they show up during LLMing (upstream), as well as during classification applications after fine-tuning (downstream). Our experiments on four architecture sizes of BERT demonstrate that pre-training data substantially influences how upstream biases evolve with model scale. With increasing scale, models pre-trained on large internet scrapes like Common Crawl exhibit higher toxicity, whereas models pre-trained on moderated data sources like Wikipedia show greater gender stereotypes. However, downstream biases generally decrease with increasing model scale, irrespective of the pre-training data. Our results highlight the qualitative role of pre-training data in the biased behavior of LLMs, an often overlooked aspect in the study of scale. Through a detailed case study of BERT, we shed light on the complex interplay of data and model scale, and investigate how it translates to concrete biases.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Muhammad Ali (37 papers)
  2. Swetasudha Panda (1 paper)
  3. Qinlan Shen (6 papers)
  4. Michael Wick (12 papers)
  5. Ari Kobren (9 papers)
Citations (2)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets