Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evaluating Gender Bias in Hindi-English Machine Translation (2106.08680v1)

Published 16 Jun 2021 in cs.CL, cs.AI, cs.CY, and cs.LG

Abstract: With LLMs being deployed increasingly in the real world, it is essential to address the issue of the fairness of their outputs. The word embedding representations of these LLMs often implicitly draw unwanted associations that form a social bias within the model. The nature of gendered languages like Hindi, poses an additional problem to the quantification and mitigation of bias, owing to the change in the form of the words in the sentence, based on the gender of the subject. Additionally, there is sparse work done in the realm of measuring and debiasing systems for Indic languages. In our work, we attempt to evaluate and quantify the gender bias within a Hindi-English machine translation system. We implement a modified version of the existing TGBI metric based on the grammatical considerations for Hindi. We also compare and contrast the resulting bias measurements across multiple metrics for pre-trained embeddings and the ones learned by our machine translation model.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Gauri Gupta (8 papers)
  2. Krithika Ramesh (7 papers)
  3. Sanjay Singh (69 papers)
Citations (20)