Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The power of Prompts: Evaluating and Mitigating Gender Bias in MT with LLMs (2407.18786v1)

Published 26 Jul 2024 in cs.CL

Abstract: This paper studies gender bias in machine translation through the lens of LLMs. Four widely-used test sets are employed to benchmark various base LLMs, comparing their translation quality and gender bias against state-of-the-art Neural Machine Translation (NMT) models for English to Catalan (En $\rightarrow$ Ca) and English to Spanish (En $\rightarrow$ Es) translation directions. Our findings reveal pervasive gender bias across all models, with base LLMs exhibiting a higher degree of bias compared to NMT models. To combat this bias, we explore prompting engineering techniques applied to an instruction-tuned LLM. We identify a prompt structure that significantly reduces gender bias by up to 12% on the WinoMT evaluation dataset compared to more straightforward prompts. These results significantly reduce the gender bias accuracy gap between LLMs and traditional NMT systems.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Aleix Sant (2 papers)
  2. Carlos Escolano (20 papers)
  3. Audrey Mash (3 papers)
  4. Francesca De Luca Fornaciari (3 papers)
  5. Maite Melero (9 papers)