Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

GATE: A Challenge Set for Gender-Ambiguous Translation Examples (2303.03975v1)

Published 7 Mar 2023 in cs.CL

Abstract: Although recent years have brought significant progress in improving translation of unambiguously gendered sentences, translation of ambiguously gendered input remains relatively unexplored. When source gender is ambiguous, machine translation models typically default to stereotypical gender roles, perpetuating harmful bias. Recent work has led to the development of "gender rewriters" that generate alternative gender translations on such ambiguous inputs, but such systems are plagued by poor linguistic coverage. To encourage better performance on this task we present and release GATE, a linguistically diverse corpus of gender-ambiguous source sentences along with multiple alternative target language translations. We also provide tools for evaluation and system analysis when using GATE and use them to evaluate our translation rewriter system.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Spencer Rarrick (4 papers)
  2. Ranjita Naik (8 papers)
  3. Varun Mathur (6 papers)
  4. Sundar Poudel (3 papers)
  5. Vishal Chowdhary (7 papers)
Citations (14)