Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Gender inference: can chatGPT outperform common commercial tools? (2312.00805v1)

Published 24 Nov 2023 in cs.CL and cs.AI

Abstract: An increasing number of studies use gender information to understand phenomena such as gender bias, inequity in access and participation, or the impact of the Covid pandemic response. Unfortunately, most datasets do not include self-reported gender information, making it necessary for researchers to infer gender from other information, such as names or names and country information. An important limitation of these tools is that they fail to appropriately capture the fact that gender exists on a non-binary scale, however, it remains important to evaluate and compare how well these tools perform in a variety of contexts. In this paper, we compare the performance of a generative AI tool ChatGPT with three commercially available list-based and machine learning-based gender inference tools (Namsor, Gender-API, and genderize.io) on a unique dataset. Specifically, we use a large Olympic athlete dataset and report how variations in the input (e.g., first name and first and last name, with and without country information) impact the accuracy of their predictions. We report results for the full set, as well as for the subsets: medal versus non-medal winners, athletes from the largest English-speaking countries, and athletes from East Asia. On these sets, we find that Namsor is the best traditional commercially available tool. However, ChatGPT performs at least as well as Namsor and often outperforms it, especially for the female sample when country and/or last name information is available. All tools perform better on medalists versus non-medalists and on names from English-speaking countries. Although not designed for this purpose, ChatGPT may be a cost-effective tool for gender prediction. In the future, it might even be possible for ChatGPT or other large scale LLMs to better identify self-reported gender rather than report gender on a binary scale.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (51)
  1. Gender trends in computer science authorship. Communications of the ACM, 64(3):78–84, March 2021. ISSN 0001-0782, 1557-7317. doi:10.1145/3430803.
  2. Women are underrepresented in computational biology: An analysis of the scholarly literature in biology, computer science and computational biology. PLOS Computational Biology, 13(10):e1005134, October 2017. ISSN 1553-7358. doi:10.1371/journal.pcbi.1005134.
  3. Trends in Gender Disparities in Authorship of Arthroplasty Research. JBJS, 102(23):e131, December 2020. ISSN 0021-9355. doi:10.2106/JBJS.20.00258.
  4. Gender Differences in Publication Authorship During COVID-19: A Bibliometric Analysis of High-Impact Cardiology Journals. Journal of the American Heart Association, 10(5):e019005, March 2021. doi:10.1161/JAHA.120.019005.
  5. Gender (Im)balance in Citation Practices in Cognitive Neuroscience. Journal of Cognitive Neuroscience, 33(1):3–7, January 2021. ISSN 0898-929X. doi:10.1162/jocn_a_01643.
  6. The extent and drivers of gender imbalance in neuroscience reference lists. Nature Neuroscience, 23(8):918–926, August 2020. ISSN 1546-1726. doi:10.1038/s41593-020-0658-y.
  7. Quantitative evaluation of gender bias in astronomical publications from citation counts. Nature Astronomy, 1(6):1–5, May 2017. ISSN 2397-3366. doi:10.1038/s41550-017-0141.
  8. Gender diversity in the management field: Does it matter for research outcomes? Research Policy, 48(7):1617–1632, September 2019. ISSN 0048-7333. doi:10.1016/j.respol.2019.03.006.
  9. One and a half million medical papers reveal a link between author gender and attention to gender and sex analysis. Nature Human Behaviour, 1(11):791–796, November 2017. ISSN 2397-3374. doi:10.1038/s41562-017-0235-x.
  10. Gender Representation on Journal Editorial Boards in the Mathematical Sciences. PLOS ONE, 11(8):e0161357, August 2016. ISSN 1932-6203. doi:10.1371/journal.pone.0161357.
  11. Women Representation on Editorial Boards in Latin America Journals: Promoting Gender Equity in Academic Surgery, Anesthesia, and Obstetrics. World Journal of Surgery, 47(4):845–853, April 2023. ISSN 1432-2323. doi:10.1007/s00268-022-06872-8.
  12. Author Gender Inequality in Medical Imaging Journals and the COVID-19 Pandemic. Radiology, 300(1):E301–E307, July 2021. ISSN 0033-8419. doi:10.1148/radiol.2021204417.
  13. Melina R. Kibbe. Consequences of the COVID-19 Pandemic on Manuscript Submissions by Women. JAMA Surgery, 155(9):803–804, September 2020. ISSN 2168-6254. doi:10.1001/jamasurg.2020.3917.
  14. Impact of the Coronavirus Disease 2019 Pandemic on Authorship Gender in The Journal of Pediatrics: Disproportionate Productivity by International Male Researchers. The Journal of Pediatrics, 231:50–54, April 2021. ISSN 0022-3476. doi:10.1016/j.jpeds.2020.12.032.
  15. Gender Differences in First and Corresponding Authorship in Public Health Research Submissions During the COVID-19 Pandemic. American Journal of Public Health, 111(1):159–163, January 2021. ISSN 0090-0036. doi:10.2105/AJPH.2020.305975.
  16. The impact of COVID-19 on academic productivity by female physicians and researchers in transfusion medicine. Transfusion, 61(6):1690–1693, 2021. ISSN 1537-2995. doi:10.1111/trf.16306.
  17. On the value of encouraging gender tolerance and inclusiveness in software engineering communities. Information and Software Technology, 139:106667, November 2021. ISSN 0950-5849. doi:10.1016/j.infsof.2021.106667.
  18. From Seeker Side to Investor Side: Gender Dynamics in UK Equity Crowdfunding Investments. In Elisabetta Gualandri, Valeria Venturelli, and Alex Sclip, editors, Frontier Topics in Banking: Investigating New Trends and Recent Developments in the Financial Industry, Palgrave Macmillan Studies in Banking and Financial Institutions, pages 97–115. Springer International Publishing, Cham, 2019. ISBN 978-3-030-16295-5. doi:10.1007/978-3-030-16295-5_4.
  19. Does equity crowdfunding democratize entrepreneurial finance? Small Business Economics, 56(2):533–552, February 2021. ISSN 1573-0913. doi:10.1007/s11187-019-00188-z.
  20. Gender, Race, and Entrepreneurship: A Randomized Field Experiment on Venture Capitalists and Angels, December 2020.
  21. A study of gender in user reviews on the google play store. Empirical Software Engineering, 27(2):34, 2022.
  22. Marysia Szymkowiak. Genderizing fisheries: Assessing over thirty years of women’s participation in Alaska fisheries. Marine Policy, 115:103846, May 2020. ISSN 0308-597X. doi:10.1016/j.marpol.2020.103846.
  23. Inferring Gender from Names on the Web: A Comparative Evaluation of Gender Detection Methods. In Proceedings of the 25th International Conference Companion on World Wide Web - WWW ’16 Companion, pages 53–54, Montréal, Québec, Canada, 2016. ACM Press. ISBN 978-1-4503-4144-8. doi:10.1145/2872518.2889385.
  24. Comparison and benchmark of name-to-gender inference services. PeerJ Computer Science, 4:e156, 2018.
  25. Automatic classification of single facial images. IEEE Transactions on Pattern Analysis and Machine Intelligence, 21(12):1357–1362, December 1999. ISSN 1939-3539. doi:10.1109/34.817413.
  26. Gender identification using frontal facial images. In 2005 IEEE International Conference on Multimedia and Expo, pages 4 pp.–, July 2005. doi:10.1109/ICME.2005.1521613.
  27. What’s in a name?–gender classification of names with character based machine learning models. Data Mining and Knowledge Discovery, 35(4):1537–1563, 2021.
  28. Voice based gender classification using machine learning. IOP Conference Series: Materials Science and Engineering, 263(4):042083, November 2017. ISSN 1757-899X. doi:10.1088/1757-899X/263/4/042083.
  29. Gender Recognition by Voice Using an Improved Self-Labeled Algorithm. Machine Learning and Knowledge Extraction, 1(1):492–503, March 2019. ISSN 2504-4990. doi:10.3390/make1010030.
  30. “So what if ChatGPT wrote it?” Multidisciplinary perspectives on opportunities, challenges and implications of generative conversational AI for research, practice and policy. International Journal of Information Management, 71:102642, August 2023. ISSN 0268-4012. doi:10.1016/j.ijinfomgt.2023.102642.
  31. ChatGPT and a new academic reality: Artificial Intelligence-written research papers and the ethics of the large language models in scholarly publishing. Journal of the Association for Information Science and Technology, 74(5):570–581, 2023. ISSN 2330-1643. doi:10.1002/asi.24750.
  32. What ChatGPT and generative AI mean for science. Nature, 614(7947):214–216, February 2023. doi:10.1038/d41586-023-00340-6.
  33. Chatting about ChatGPT: How may AI and GPT impact academia and libraries? Library Hi Tech News, ahead-of-print(ahead-of-print), January 2023. ISSN 0741-9058. doi:10.1108/LHTN-01-2023-0009.
  34. Viriya Taecharungroj. “What Can ChatGPT Do?” Analyzing Early Reactions to the Innovative AI Chatbot on Twitter. Big Data and Cognitive Computing, 7(1):35, March 2023. ISSN 2504-2289. doi:10.3390/bdcc7010035.
  35. Bibliometrics: Global gender disparities in science. Nature, 504(7479):211–213, 2013.
  36. Naive-deep face recognition: Touching the limit of lfw benchmark or not? arXiv preprint arXiv:1501.04690, 2015.
  37. H Mihaljević and Lucía Santamaría. Evaluation of name-based gender inference methods, 2018. URL https://github.com/GenderGapSTEM-PublicationAnalysis/name_gender_inference.
  38. Paul Sebo. Performance of gender detection tools: A comparative study of name-to-gender inference services. Journal of the Medical Library Association : JMLA, 109(3):414–421, 2021a. ISSN 1536-5050. doi:10.5195/jmla.2021.1185.
  39. Paul Sebo. Using genderize.io to infer the gender of first names: How to improve the accuracy of the inference. Journal of the Medical Library Association : JMLA, 109(4):609–612, 2021b. ISSN 1536-5050. doi:10.5195/jmla.2021.1252.
  40. Context-sensitive gender inference of named entities in text. Information Processing & Management, 58(1):102423, 2021.
  41. Paul Sebo. Are Accuracy Parameters Useful for Improving the Performance of Gender Detection Tools? A Comparative Study with Western and Chinese Names. Journal of General Internal Medicine, 37(15):4024–4027, November 2022. ISSN 1525-1497. doi:10.1007/s11606-022-07469-6.
  42. Gender identification in Chinese names. Lingua, 234:102759, January 2020. ISSN 0024-3841. doi:10.1016/j.lingua.2019.102759.
  43. Science-Metrix. Analytical support for bibliometrics indicators: development of bibliometric indicators to measure women’s contribution to scientific publications, 2018. URL http://www.science-metrix.com/sites/default/files/science-metrix/publications/science-metrix_bibliometric_indicators_womens_contribution_to_science_report.pdf.
  44. Language models are few-shot learners. In Proceedings of the 34th International Conference on Neural Information Processing Systems, NIPS’20, Red Hook, NY, USA, 2020. Curran Associates Inc. ISBN 9781713829546.
  45. ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning, April 2023.
  46. Namsor. Determine the gender of a name. https://namsor.app/features/gender-name.
  47. Historical comparison of gender inequality in scientific careers across countries and disciplines. Proceedings of the National Academy of Sciences, 117(9):4609–4616, 2020.
  48. Are gender gaps due to evaluations of the applicant or the science? a natural experiment at a national funding agency. The Lancet, 393(10171):531–540, 2019.
  49. Gender and Employment in the COVID-19 Recession: Evidence on “She-cessions”. International Monetary Fund, 2021.
  50. Unequal effects of the covid-19 pandemic on scientists. Nature human behaviour, 4(9):880–883, 2020.
  51. Gender gap in journal submissions and peer review during the first wave of the covid-19 pandemic. a study on 2329 elsevier journals. PloS one, 16(10):e0257919, 2021.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
Citations (4)