Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Examining Gender Bias in Languages with Grammatical Gender (1909.02224v2)

Published 5 Sep 2019 in cs.CL

Abstract: Recent studies have shown that word embeddings exhibit gender bias inherited from the training corpora. However, most studies to date have focused on quantifying and mitigating such bias only in English. These analyses cannot be directly extended to languages that exhibit morphological agreement on gender, such as Spanish and French. In this paper, we propose new metrics for evaluating gender bias in word embeddings of these languages and further demonstrate evidence of gender bias in bilingual embeddings which align these languages with English. Finally, we extend an existing approach to mitigate gender bias in word embeddings under both monolingual and bilingual settings. Experiments on modified Word Embedding Association Test, word similarity, word translation, and word pair translation tasks show that the proposed approaches effectively reduce the gender bias while preserving the utility of the embeddings.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Pei Zhou (30 papers)
  2. Weijia Shi (55 papers)
  3. Jieyu Zhao (54 papers)
  4. Kuan-Hao Huang (33 papers)
  5. Muhao Chen (159 papers)
  6. Ryan Cotterell (226 papers)
  7. Kai-Wei Chang (292 papers)
Citations (99)