Gender Neutralization for an Inclusive Machine Translation: from Theoretical Foundations to Open Challenges (2301.10075v3)
Abstract: Gender inclusivity in language technologies has become a prominent research topic. In this study, we explore gender-neutral translation (GNT) as a form of gender inclusivity and a goal to be achieved by machine translation (MT) models, which have been found to perpetuate gender bias and discrimination. Specifically, we focus on translation from English into Italian, a language pair representative of salient gender-related linguistic transfer problems. To define GNT, we review a selection of relevant institutional guidelines for gender-inclusive language, discuss its scenarios of use, and examine the technical challenges of performing GNT in MT, concluding with a discussion of potential solutions to encourage advancements toward greater inclusivity in MT.
- 2021. E-mimic: Empowering multilingual inclusive communication. In IEEE Big Data 2021, pages 4227–4234.
- 2005. METEOR: An automatic metric for MT evaluation with improved correlation with human judgments. In ACL Workshop on Intrinsic and Extrinsic Evaluation Measures for Machine Translation and/or Summarization, pages 65–72.
- 2020. Towards mitigating gender bias in a decoder-based neural machine translation model by adding contextual information. In The Fourth Widening Natural Language Processing Workshop, pages 99–102.
- 2018. Evaluating discourse phenomena in neural machine translation. In 16th NAACL HLT, pages 1304–1313.
- 2020. Gender in Danger? Evaluating Speech Translation Technology on the MuST-SHE Corpus. In 58th ACL, pages 6923–6933.
- 2020. Language (Technology) is Power: A Critical Survey of ``Bias'' in NLP. In 58th ACL, pages 5454–5476.
- 2003. Sex, syntax, and semantics. In Getner, Dedre and Susan Goldin-Meadow, editors, Language in Mind: Advances in the Study of Language and Thought, pages 61–79. MIT Press.
- 2022. “Es Geht Um Respekt, Nicht Um Technologie”: Erkenntnisse Aus Einem Interessensgruppen-üBergreifenden Workshop Zu Genderfairer Sprache Und Sprachtechnologie. In Mensch Und Computer 2022, page 106–118.
- Cameron, Deborah. 1995. Verbal Hygiene (The Politics of Language. Routledge.
- 2020. Toward gender-inclusive coreference resolution. In 58th ACL, pages 4568–4595.
- 2020. Lexical-constraint-aware neural machine translation via data augmentation. In 29th IJCAI, pages 3587–3593.
- 2019. On Measuring Gender Bias in Translation of Gender-neutral Pronouns. In First Workshop on Gender Bias in Natural Language Processing, pages 173–181.
- 2021. Input augmentation improves constrained beam search for neural machine translation: NTT at WAT 2021. In 8th Workshop on Asian Translation, pages 53–61.
- Comandini, Gloria. 2021. Salve a tutt, tutt*, tuttu, tuttx e tutt@: l’uso delle strategie di neutralizzazione di genere nella comunità queer online. : Indagine su un corpus di italiano scritto informale sul web. Testo e Senso, 23:43–64.
- Crenshaw, Kimberle Williams. 1991. Mapping the margins: Intersectionality, identity politics, and violence against women of color. Stanford Law Review, 43(6):1241–99.
- 2021. Harms of Gender Exclusivity and Challenges in Non-Binary Representation in Language Technologies. In EMNLP 2021, pages 1968–1994.
- 2019. Training neural machine translation to apply terminology constraints. In 57th ACL, pages 3063–3068.
- 2002. Near-synonymy and lexical choice. Computational Linguistics, 28(2):105–144.
- 2022. Why is constrained neural language generation particularly challenging? ArXiv e-prints arXiv:2206.05395.
- 2008. Generically Intended, but Specifically Interpreted: When Beauticians, Musicians and Mechanics are all Men. Language and Cognitive Processes, 23:464–485.
- 2019. A Language Index of Grammatical Gender Dimensions to Study the Impact of Grammatical Gender on the Way We Perceive Women and Men. Frontiers in Psychology, 10.
- He, Guimei. 2010. An analysis of sexism in english. Journal of Language Teaching and Research, 1(3):332–335.
- 2007. Handbook of Language and Communication: Diversity and Change. De Gruyter Mouton.
- 2017. Lexically constrained decoding for sequence generation using grid beam search. In 55th ACL, pages 1535–1546.
- Kiesling, Scott F. 2019. Language, Gender, and Sexuality: An introduction. Routledge.
- 2019. When and Why is Document-level Context Useful in Neural Machine Translation? In Fourth Workshop on Discourse in Machine Translation, pages 24–34.
- 2022. Welcome to the Modern World of Pronouns: Identity-Inclusive Natural Language Processing beyond Gender. In 29th COLING, pages 1221–1232.
- Lazar, Michelle M. 2005. Politicizing gender in discourse: Feminist critical discourse analysis as political perspective and praxis. In Lazar, Michelle M., editor, Feminist Critical Discourse Analysis: Gender, Power and Ideology in Discourse, pages 1–28.
- 2020. Does multi-encoder help? a case study on context-aware neural machine translation. In 58th ACL, pages 3512–3518.
- 2020. Document-level neural MT: A systematic comparison. In 22nd EAMT, pages 225–234.
- 2002. BLEU: a Method for Automatic Evaluation of Machine Translation. In 40th ACL, pages 311–318.
- Popović, Maja. 2015. chrF: character n-gram F-score for automatic MT evaluation. In WMT15, pages 392–395.
- 2018. Fast lexically constrained decoding with dynamic beam allocation for neural machine translation. In 16th NAACL HLT, pages 1314–1324.
- 2020. Assessing gender bias in machine translation: a case study with google translate. Neural Computing and Applications, 32(10):6363–6381.
- 2015. Truly exploring multiple references for machine translation evaluation. In 18th EAMT.
- 2020. COMET: A Neural Framework for MT Evaluation. In EMNLP 2020, pages 2685–2702.
- 2019. Integration of dubbing constraints into machine translation. In WMT19, pages 94–101.
- 2020. Reducing gender bias in neural machine translation as a domain adaptation problem. In 58th ACL, pages 7724–7736.
- 2022. First the worst: Finding better gender translations during beam search. Findings of 60th ACL, pages 3814–3823.
- 2021. Gender bias in machine translation. TACL, 9:845–874.
- 2022. Under the morphosyntactic lens: A multifaceted evaluation of gender bias in speech translation. In 60th ACL, pages 1807–1824.
- 2006. A Study of Translation Edit Rate with Targeted Human Annotation. In 7th AMTA, pages 223–231.
- 2019. Code-switching for enhancing NMT with pre-specified translation. In 17th NAACL HLT, pages 449–459.
- 2020. Mitigating Gender Bias in Machine Translation with Target Gender Annotations. In WMT20, pages 629–638.
- 2007. Representation of the Sexes in Language. Social communication, pages 163–187.
- 2019. Evaluating Gender Bias in Machine Translation. In 57th ACL, pages 1679–1684.
- 2021. They, Them, Theirs: Rewriting with Gender-Neutral English. arXiv preprint arXiv:2102.06788.
- van Deemter, Kees. 2009. Utility and language generation: The case of vagueness. Journal of Philosophical Logic, 38(6):607–632.
- 2018. Getting gender right in neural machine translation. In EMNLP 2018, pages 3003–3008.
- 2021. NeuTral Rewriter: A rule-based and neural approach to automatic rewriting into gender neutral alternatives. In EMNLP 2021, pages 8940–8948.
- 2009. ¿qué? quoi? do languages with grammatical gender promote sexist attitudes? Sex Roles: A Journal of Research, 61:634–643.
- Andrea Piergentili (6 papers)
- Dennis Fucci (11 papers)
- Beatrice Savoldi (19 papers)
- Luisa Bentivogli (38 papers)
- Matteo Negri (93 papers)