Stop! In the Name of Flaws: Disentangling Personal Names and Sociodemographic Attributes in NLP (2405.17159v2)
Abstract: Personal names simultaneously differentiate individuals and categorize them in ways that are important in a given society. While the natural language processing community has thus associated personal names with sociodemographic characteristics in a variety of tasks, researchers have engaged to varying degrees with the established methodological problems in doing so. To guide future work that uses names and sociodemographic characteristics, we provide an overview of relevant research: first, we present an interdisciplinary background on names and naming. We then survey the issues inherent to associating names with sociodemographic attributes, covering problems of validity (e.g., systematic error, construct validity), as well as ethical concerns (e.g., harms, differential impact, cultural insensitivity). Finally, we provide guiding questions along with normative recommendations to avoid validity and ethical pitfalls when dealing with names and sociodemographic characteristics in natural language processing.
- Error analysis of Uyghur name tagging: Language-specific techniques and remaining challenges. In Proceedings of the Eleventh International Conference on Language Resources and Evaluation (LREC 2018), Miyazaki, Japan. European Language Resources Association (ELRA).
- Michael D. Adams. 2009. Power, politeness, and the pragmatics of nicknames. Names, 57:81–91.
- AIATSIS. 2022. Indigenous names. https://aiatsis.gov.au/family-history/you-start/indigenous-names.
- Richard Alford. 1987. Naming and identity: A cross-cultural study of personal naming practices. Hraf Press.
- David J. Allerton. 1987. The linguistic and sociolinguistic status of proper names what are they, and who do they belong to? Journal of Pragmatics, 11:61–92.
- Names in literature : essays from Literary onomastics studies. University Press of America.
- John Anderson. 2003. On the structure of names. Folia Linguistica.
- The gender gap tracker: Using natural language processing to measure gender bias in media. PLoS ONE, 16.
- Austin A Baker and J Remy Green. 2021. There is no such thing as a ‘legal name’: A strange, shared delusion. Columbia Human Rights Law Review, 53:129.
- Shaowen Bardzell. 2010. Feminist hci: taking stock and outlining an agenda for design. In Proceedings of the SIGCHI Conference on Human Factors in Computing Systems, page 1301–1310, Atlanta Georgia USA. ACM.
- Herbert Barry and Aylene S. Harper. 1982. Evolution of unisex names. Names, 30(1):15–22.
- Herbert Barry and Aylene S. Harper. 1993. Feminization of unisex names from 1960 to 1990. Names, 41(4):228–238.
- Birgit Becker. 2009. Immigrants’ emotional identification with the host society: The example of turkish parents’ naming practices in germany. Ethnicities, 9(2):200–225.
- What is the point of fairness? disability, ai and the complexity of justice. SIGACCESS Access. Comput., (125).
- Sebastian Benthall and Bruce D. Haynes. 2019. Racial categories in machine learning. In Proceedings of the Conference on Fairness, Accountability, and Transparency, FAT* ’19, page 289–298, New York, NY, USA. Association for Computing Machinery.
- Elettra Bietti. 2019. From ethics washing to ethics bashing: a view on tech ethics from within moral philosophy. Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency.
- The values encoded in machine learning research. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’22, page 173–184, New York, NY, USA. Association for Computing Machinery.
- Language (technology) is power: A critical survey of “bias” in NLP. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 5454–5476, Online. Association for Computational Linguistics.
- Geoffrey C. Bowker and Susan Leigh Star. 2000. Sorting Things Out: Classification and Its Consequences. The MIT Press.
- Semantics derived automatically from language corpora contain human-like biases. Science, 356(6334):183–186.
- Graham Cameron. 2004. Evidence in an indigenous world. In Australasian Evaluation Society 2004 International Conference, Adelaide, South Australia.
- Yang Trista Cao and Hal Daumé III. 2021. Toward gender-inclusive coreference resolution: An analysis of gender and bias throughout the machine learning lifecycle*. Computational Linguistics, 47(3):615–661.
- Cherie Chan. 2016. Why Chinese speakers use Western names. https://www.dw.com/en/why-some-chinese-speakers-also-use-western-names/a-18966907.
- epluribus: Ethnicity on social networks. Proceedings of the International AAAI Conference on Web and Social Media.
- Jennifer Chien and David Danks. 2024. Beyond behaviorist representational harms: A plan for measurement and mitigation. arXiv preprint arXiv:2402.01705.
- Bagele Chilisa. 2019. Indigenous research methodologies. Sage publications.
- Patricia Hill Collins. 2019. Intersectionality as Critical Social Theory. Duke University Press.
- Validated names for experimental studies on race and ethnicity. Scientific Data, 10.
- Classist tools: Social class correlates with performance in nlp. Preprint, arXiv:2403.04445.
- Edward E. Curtis. 2005. African-american islamization reconsidered: Black history narratives and muslim identity. Journal of the American Academy of Religion, 73:659–684.
- Stephen Darwall. 1977. Two kinds of respect. Ethics, 88:36 – 49.
- Bias in bios: A case study of semantic representation bias in a high-stakes setting. In Proceedings of the Conference on Fairness, Accountability, and Transparency, FAT* ’19, page 120–128, New York, NY, USA. Association for Computing Machinery.
- Harms of gender exclusivity and challenges in non-binary representation in language technologies. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 1968–1994, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.
- Theories of “gender” in nlp bias research. In Proceedings of the 2022 ACM Conference on Fairness, Accountability, and Transparency, FAccT ’22, page 2083–2102, New York, NY, USA. Association for Computing Machinery.
- An intersectional approach to designing in the margins. Interactions, 25(3):66–69.
- Michael Färber and Lin Ao. 2022. The microsoft academic knowledge graph enhanced: Author name disambiguation, publication classification, and embeddings. Quantitative Science Studies, 3:51–98.
- J. Feinberg. 1984. Harmless Wrongdoing. Moral Limits of the Criminal Law. Oxford University Press.
- A survey of race, racism, and anti-racism in NLP. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1905–1925, Online. Association for Computational Linguistics.
- Luciano Floridi and Josh Cowls. 2019. A Unified Framework of Five Principles for AI in Society. Harvard Data Science Review, 1(1). Https://hdsr.mitpress.mit.edu/pub/l0jsh9d1.
- S. Michael Gaddis. 2017a. How black are lakisha and jamal? racial perceptions from names used in correspondence audit studies. Randomized Social Experiments eJournal.
- S. Michael Gaddis. 2017b. Racial/ethnic perceptions from hispanic names: Selecting names to test for discrimination. Socius, 3.
- Datasheets for datasets. Commun. ACM, 64(12):86–92.
- Ben Green. 2019. Good” isn’t good enough. In Proceedings of the AI for Social Good workshop at NeurIPS, volume 17.
- Ben Green. 2021. The contestation of tech ethics: A sociotechnical approach to technology ethics in practice. Journal of Social Computing, 2(3):209–225.
- Better, nicer, clearer, fairer: A critical assessment of the movement for ethical artificial intelligence and machine learning. In Proceedings of the 52nd Hawaii International Conference on System Sciences.
- Oliver L Haimson and Anna Lauren Hoffmann. 2016. Constructing and enforcing" authentic" identity online: Facebook, real names, and non-normative identities. First Monday.
- Towards a critical race methodology in algorithmic fairness. In Proceedings of the 2020 Conference on Fairness, Accountability, and Transparency, FAT* ’20, page 501–512, New York, NY, USA. Association for Computing Machinery.
- Alex Hanna and Tina M Park. 2020. Against scale: Provocations and resistances to scale thinking. arXiv preprint arXiv:2010.08850.
- Karen Hao. 2019. In 2020, let’s stop AI ethics-washing and actually do something. https://www.technologyreview.com/2019/12/27/57/ai-ethics-washing-time-to-act/.
- Donna Haraway. 1988. Situated knowledges: The science question in feminism and the privilege of partial perspective. Feminist Studies, 14(3):575–599.
- Leigh Honeywell. 2016. neveragain.tech. https://neveragain.tech/.
- Carole Hough. 2016. The Oxford Handbook of Names and Naming. Oxford University Press.
- Abigail Z. Jacobs and Hanna Wallach. 2021. Measurement and fairness. In Proceedings of the 2021 ACM Conference on Fairness, Accountability, and Transparency, page 375–385, Virtual Event Canada. ACM.
- Examining the causal impact of first names on language models: The case of social commonsense reasoning. In Proceedings of the 3rd Workshop on Trustworthy Natural Language Processing (TrustNLP 2023), pages 61–72, Toronto, Canada. Association for Computational Linguistics.
- Robin Jeshion. 2009. The significance of names. Mind & Language, 24(4):370–403.
- Austin H Johnson. 2016. Transnormativity: A new concept and its validation through documentary film about transgender men. Sociological inquiry, 86(4):465–491.
- Astrid Kaiser. 2010. „kevin ist kein name, sondern eine diagnose!“der vorname in der grundschule–klangwort, modewort oder reizwort. Die Grundschulzeitschrift, 24:26–29.
- Inferring gender from names on the web: A comparative evaluation of gender detection methods. In Proceedings of the 25th International Conference Companion on World Wide Web - WWW ’16 Companion, page 53–54, Montréal, Québec, Canada. ACM Press.
- Os Keyes. 2017. Stop mapping names to gender.
- Os Keyes. 2018. The misgendering machines: Trans/hci implications of automatic gender recognition. Proc. ACM Hum.-Comput. Interact., 2(CSCW).
- Human-computer insurrection: Notes on an anarchist hci. In Proceedings of the 2019 CHI Conference on Human Factors in Computing Systems, CHI ’19, page 1–13, New York, NY, USA. Association for Computing Machinery.
- You keep using that word: Ways of thinking about gender in computing research. Proc. ACM Hum.-Comput. Interact., 5(CSCW1).
- What’s in a name? a multiracial investigation of the role of occupational stereotypes in selection decisions. Journal of Applied Social Psychology, 36:1145–1159.
- Systematic errors, page 83–96. Cambridge University Press.
- Demographer: Extremely simple name demographics. In Proceedings of the First Workshop on NLP and Computational Social Science, pages 108–113, Austin, Texas. Association for Computational Linguistics.
- Lex Konnelly. 2021. Nuance and normativity in trans linguistic research. Journal of Language and Sexuality, 10:71–82.
- Avoiding bias when inferring race using name-based approaches. Plos one, 17(3):e0264270.
- Brian Larson. 2017. Gender as a variable in natural-language processing: Ethical considerations. In Proceedings of the First ACL Workshop on Ethics in Natural Language Processing, pages 1–11, Valencia, Spain. Association for Computational Linguistics.
- Welcome to the modern world of pronouns: Identity-inclusive natural language processing beyond gender. In Proceedings of the 29th International Conference on Computational Linguistics, pages 1221–1232, Gyeongju, Republic of Korea. International Committee on Computational Linguistics.
- David C.S. Li. 1997. Borrowed identity: Signaling involvement with a western name. Journal of Pragmatics, 28(4):489–513. Language and Discourse Issues in Hong Kong’s Change of Sovereignity.
- Semantic transliteration of personal names. In Proceedings of the 45th Annual Meeting of the Association of Computational Linguistics, pages 120–127, Prague, Czech Republic. Association for Computational Linguistics.
- Are we learning yet? a meta review of evaluation failures across machine learning. In Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 2).
- Wendy Liu and Derek Ruths. 2013. What’s in a name? using first names as features for gender inference in twitter. In AAAI Spring Symposium: Analyzing Microtext.
- Name-based demographic inference and the unequal distribution of misrecognition. Nature Human Behaviour, 7(7):1084–1095.
- María Lugones. 2016. The Coloniality of Gender, pages 13–33. Palgrave Macmillan UK, London.
- Gideon S. Mann and David Yarowsky. 2003. Unsupervised personal name disambiguation. In Conference on Computational Natural Language Learning.
- Gary T. Marx. 1999. What’s in a name? some reflections on the sociology of anonymity. Inf. Soc., 15:99–112.
- It’s all in the name: Mitigating gender bias with name-based counterfactual data substitution. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 5267–5275, Hong Kong, China. Association for Computational Linguistics.
- Patrick McKenzie. 2010. Falsehoods Programmers Believe About Names | Kalzumeus Software. https://www.kalzumeus.com/2010/06/17/falsehoods-programmers-believe-about-names/.
- Chan Tov Mcnamarah. 2020. Misgendering. The SAGE Encyclopedia of Trans Studies.
- José Medina. 2017. Varieties of hermeneutical injustice, pages 41–52. Taylor and Francis.
- Ramanujam Meganathan. 2009. The politics of naming. Contributions to Indian Sociology, 43:317–324.
- Samuel Messick. 1995. Standards of validity and the validity of standards in performance asessment. Educational Measurement: Issues and Practice, 14(4):5–8.
- Jennifer Mickel. 2024. Racial/ethnic categories in ai and algorithmic fairness: Why they matter and what they represent. ArXiv, abs/2404.06717.
- Sabrina Mielke. 2024. Personal communication.
- Reflections on gender analyses of bibliographic corpora. Frontiers in Big Data, 2:29.
- Extracting personal names from email: Applying named entity recognition to informal text. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing, pages 443–450, Vancouver, British Columbia, Canada. Association for Computational Linguistics.
- Saif M. Mohammad. 2020. Gender gap in natural language processing research: Disparities in authorship and citations. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics, pages 7860–7870, Online. Association for Computational Linguistics.
- Renaming me: Assessing the influence of gender identity on name selection. Names, 67(4):199–211.
- Ìkòtún Reuben Olúwáfeḿi. 2014. New trends in yorùbá personal names among yorùbá christians. Linguistik Online, 59(2).
- Factoring the matrix of domination: A critical review and reimagination of intersectionality in ai fairness. In Proceedings of the 2023 AAAI/ACM Conference on AI, Ethics, and Society, AIES ’23, page 496–511, New York, NY, USA. Association for Computing Machinery.
- Jane Pilcher. 2017. Names and “doing gender”: How forenames and surnames contribute to gender identities, difference, and inequalities. Sex Roles, 77:812–822.
- What’s in a name? Reducing bias in bios without access to protected attributes. In Proceedings of the 2019 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, Volume 1 (Long and Short Papers), pages 4187–4195, Minneapolis, Minnesota. Association for Computational Linguistics.
- A rose by any other name would not smell as sweet: Social bias in names mistranslation. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 3933–3945, Singapore. Association for Computational Linguistics.
- Morgan Klaus Scheuerman and Jed R. Brubaker. 2024. Products of positionality: How tech workers shape identity concepts in computer vision. In Proceedings of the CHI Conference on Human Factors in Computing Systems, page 1–18, Honolulu HI USA. ACM.
- How computers see gender: An evaluation of gender classification in commercial facial analysis services. Proc. ACM Hum.-Comput. Interact., 3(CSCW).
- Hci guidelines for gender equity and inclusivity.
- How we’ve taught algorithms to see identity: Constructing race and gender in image databases for facial analysis. Proceedings of the ACM on Human-Computer Interaction, 4(CSCW1):1–35.
- Neural machine translation of rare words with subword units. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 1715–1725, Berlin, Germany. Association for Computational Linguistics.
- Khaled Shaalan and Hafsa Raza. 2007. Person name entity recognition for Arabic. In Proceedings of the 2007 Workshop on Computational Approaches to Semitic Languages: Common Issues and Resources, pages 17–24, Prague, Czech Republic. Association for Computational Linguistics.
- “you are grounded!”: Latent name artifacts in pre-trained language models. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 6850–6861, Online. Association for Computational Linguistics.
- Eric Michael Smith and Adina Williams. 2021. Hi, my name is martha: Using names to measure and mitigate bias in generative dialogue models. ArXiv, abs/2109.03300.
- Robyn Speer. 2021. Google scholar has failed us.
- Christina R. Steidl and Regina Werum. 2019. If all you have is a hammer, everything looks like a nail: Operationalization matters. Sociology Compass, 13(8):e12727.
- Anselm L Strauss. 2017. Mirrors and masks: The search for identity. Routledge.
- Christina A. Sue and Edward E. Telles. 2007. Assimilation and gender in naming. American Journal of Sociology, 112(5):1383–1415.
- On the machine learning of ethical judgments from natural language. In Proceedings of the 2022 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, pages 769–779, Seattle, United States. Association for Computational Linguistics.
- Rachael Tatman. 2020. What i won’t build. https://www.rctatman.com/talks/what-i-wont-build.
- Department of Internal Affairs | Te Tari Taiwhenua. 2021. Press Releases - dia.govt.nz — dia.govt.nz. https://www.dia.govt.nz/press.nsf/d77da9b523f12931cc256ac5000d19b6/d1288ac08d7758c2cc25838200107411!OpenDocument.
- Konstantinos Tzioumis. 2018. Demographic aspects of first names. Scientific Data, 5.
- United States National Commission for the Protection of Human Subjects of Biomedical and Behavioral Research. 1978. The Belmont report: ethical principles and guidelines for the protection of human subjects of research, volume 2. United States National Commission for the Protection of Human Subjects of Biomedical and Behavioral Research.
- U.S. Census. 2020. First name frequency by gender.
- U.S. Social Security Administration. 2023. Top 10 baby names of 2023.
- An open-source cultural consensus approach to name-based gender classification. Proceedings of the International AAAI Conference on Web and Social Media, 17:866–877.
- Values, ethics, morals? on the use of moral concepts in NLP research. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 5534–5554, Singapore. Association for Computational Linguistics.
- Adam Vogel and Dan Jurafsky. 2012. He said, she said: Gender in the ACL Anthology. In Proceedings of the ACL-2012 Special Workshop on Rediscovering 50 Years of Discoveries, pages 33–41, Jeju Island, Korea. Association for Computational Linguistics.
- Subhah Wadhawan. 2022. Let the machines do the dirty work: Social media, machine learning technology and the iteration of racialized surveillance. Canadian Journal of Law and Technology, 20(1):1.
- We are who we cite: Bridges of influence between natural language processing and other academic fields. In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing, pages 12896–12913, Singapore. Association for Computational Linguistics.
- Measuring and mitigating name biases in neural machine translation. In Proceedings of the 60th Annual Meeting of the Association for Computational Linguistics (Volume 1: Long Papers), pages 2576–2590, Dublin, Ireland. Association for Computational Linguistics.
- Disembodied machine learning: On the illusion of objectivity in nlp. Preprint, arXiv:2101.11974.
- Measuring and reducing gendered correlations in pre-trained models. Preprint, arXiv:2010.06032.
- Paul Weindling. 2001. The origins of informed consent: The international scientific commission on medical war crimes, and the nuremberg code. Bulletin of the History of Medicine, 75(1):37–71.
- Sasha Weitman. 1981. Some methodological issues in quantitative onomastics. Names, 29(3):181–196.
- John F. Williams. 1924. The geneva protocol of 1924 for the pacific settlement of international disputes1. Journal of the British Institute of International Affairs, 3(6):288–304.
- Robert Wolfe and Aylin Caliskan. 2021. Low frequency names exhibit bias and overfitting in contextualizing language models. In Proceedings of the 2021 Conference on Empirical Methods in Natural Language Processing, pages 518–532, Online and Punta Cana, Dominican Republic. Association for Computational Linguistics.