Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Trip Towards Fairness: Bias and De-Biasing in Large Language Models (2305.13862v2)

Published 23 May 2023 in cs.CL

Abstract: Cheap-to-Build Very Large-LLMs (CtB-LLMs) with affordable training are emerging as the next big revolution in natural language processing and understanding. These CtB-LLMs are democratizing access to trainable Very Large-LLMs (VLLMs) and, thus, may represent the building blocks of many NLP systems solving downstream tasks. Hence, a little or a large bias in CtB-LLMs may cause huge harm. In this paper, we performed a large investigation of the bias of three families of CtB-LLMs, and we showed that debiasing techniques are effective and usable. Indeed, according to current tests, the LLaMA and the OPT families have an important bias in gender, race, religion, and profession. In contrast to the analysis for other LLMs, we discovered that bias depends not on the number of parameters but on the perplexity. Finally, the debiasing of OPT using LoRA reduces bias up to 4.12 points in the normalized stereotype score.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Leonardo Ranaldi (18 papers)
  2. Elena Sofia Ruzzetti (11 papers)
  3. Davide Venditti (4 papers)
  4. Dario Onorati (3 papers)
  5. Fabio Massimo Zanzotto (25 papers)
Citations (28)

Summary

We haven't generated a summary for this paper yet.