Classist Tools: Social Class Correlates with Performance in NLP (2403.04445v1)
Abstract: Since the foundational work of William Labov on the social stratification of language (Labov, 1964), linguistics has made concentrated efforts to explore the links between sociodemographic characteristics and language production and perception. But while there is strong evidence for socio-demographic characteristics in language, they are infrequently used in NLP. Age and gender are somewhat well represented, but Labov's original target, socioeconomic status, is noticeably absent. And yet it matters. We show empirically that NLP disadvantages less-privileged socioeconomic groups. We annotate a corpus of 95K utterances from movies with social class, ethnicity and geographical language variety and measure the performance of NLP systems on three tasks: LLMling, automatic speech recognition, and grammar error correction. We find significant performance disparities that can be attributed to socioeconomic status as well as ethnicity and geographical differences. With NLP technologies becoming ever more ubiquitous and quotidian, they must accommodate all language varieties to avoid disadvantaging already marginalised groups. We argue for the inclusion of socioeconomic class in future language technologies.
- Jonathan Anderson. 1983. Lix and rix: Variations on a little-known readability index. Journal of Reading, 26(6):490–496.
- XLS-R: Self-supervised cross-lingual speech representation learning at scale. arXiv preprint arXiv:2111.09296.
- wav2vec 2.0: A framework for self-supervised learning of speech representations. Advances in neural information processing systems, 33:12449–12460.
- Emily M. Bender and Batya Friedman. 2018. Data statements for natural language processing: Toward mitigating system bias and enabling better science. Transactions of the Association for Computational Linguistics, 6:587–604.
- Basil Bernstein. 1960. Language and social class. The British journal of sociology, 11(3):271–276.
- Mary Bucholtz and Kira Hall. 2005. Identity and interaction: A sociocultural linguistic approach. Discourse studies, 7(4-5):585–614.
- Eve V Clark and Marisa Casillas. 2015. First language acquisition. In The Routledge handbook of linguistics, pages 311–328. Routledge.
- Meri Coleman and Ta Lin Liau. 1975. A computer readability formula designed for machine scoring. Journal of Applied Psychology, 60(2):283.
- Are AI systems biased against the poor? A machine learning analysis using Word2Vec and GloVe embeddings. AI & society, pages 1–16.
- Penelope Eckert. 2012. Three waves of variation study: The emergence of meaning in the study of sociolinguistic variation. Annual review of Anthropology, 41(1):87–100.
- A survey of race, racism, and anti-racism in NLP. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 1905–1925, Online. Association for Computational Linguistics.
- Exploring stylistic variation with age and income on Twitter. In Proceedings of the 54th Annual Meeting of the Association for Computational Linguistics (Volume 2: Short Papers), pages 313–319, Berlin, Germany. Association for Computational Linguistics.
- Rudolph Flesch. 1948. A new readability yardstick. Journal of applied psychology, 32(3):221.
- Demystifying prompts in language models via perplexity estimation. In Findings of the Association for Computational Linguistics: EMNLP 2023, pages 10136–10148, Singapore. Association for Computational Linguistics.
- Robert Gunning. 1968. The Technique of Clear Writing. McGraw-Hill Book Company, New York.
- Mistral 7b. arXiv preprint arXiv:2310.06825.
- Cross-lingual syntactic variation over age and gender. In Proceedings of the Nineteenth Conference on Computational Natural Language Learning, pages 103–112, Beijing, China. Association for Computational Linguistics.
- Derivation of new readability formulas (automated readability index, fog count and flesch reading ease formula) for navy enlisted personnel.
- William Labov. 1964. The social stratification of English in New York city. Ph.D. thesis, Columbia University.
- Qiuana Lopez and Mary Bucholtz. 2017. “How my hair look?” Linguistic authenticity and racialized gender and sexuality on The Wire. Journal of Language and Sexuality, 6(1):1–29.
- Alec W McHoul. 1987. An initial investigation of the usability of fictional conversation for doing conversation analysis. Semiotica, 67(1-2):83–104.
- From WER and RIL to MER and WIL: Improved evaluation measures for connected speech recognition. In Interspeech 2004, pages 2765–2768. ISCA.
- JFLEG: A fluency corpus and benchmark for grammatical error correction. In Proceedings of the 15th Conference of the European Chapter of the Association for Computational Linguistics: Volume 2, Short Papers, pages 229–234, Valencia, Spain. Association for Computational Linguistics.
- Stanza: A python natural language processing toolkit for many human languages. In Proceedings of the 58th Annual Meeting of the Association for Computational Linguistics: System Demonstrations, pages 101–108, Online. Association for Computational Linguistics.
- Paulo Quaglio. 2008. Television dialogue and natural conversation. Corpora and discourse, pages 189–210.
- Robust speech recognition via large-scale weak supervision. In International Conference on Machine Learning, pages 28492–28518. PMLR.
- Nils Reimers and Iryna Gurevych. 2019. Sentence-BERT: Sentence embeddings using Siamese BERT-networks. In Proceedings of the 2019 Conference on Empirical Methods in Natural Language Processing and the 9th International Joint Conference on Natural Language Processing (EMNLP-IJCNLP), pages 3982–3992, Hong Kong, China. Association for Computational Linguistics.
- John R Rickford. 1986. The need for new approaches to social class analysis in sociolinguistics. Language and communication, 6(3):215–221.
- A new model of social class? findings from the bbc’s great british class survey experiment. Sociology, 47(2):219–250.
- Socioeconomic status and mortality. Diabetes Care, 36(1):49–55.
- Automated readability index. AMRL-TR. Aerospace Medical Research Laboratories, pages 1–14.
- Anastasia G Stamou. 2014. A literature review on the mediation of sociolinguistic style in television and cinematic fiction: Sustaining the ideology of authenticity. Language and Literature, 23(2):118–140.
- Llama 2: Open foundation and fine-tuned chat models. arXiv preprint arXiv:2307.09288.
- Zephyr: Direct distillation of lm alignment. arXiv preprint arXiv:2310.16944.
- Elisa Usategui Basozábal et al. 1992. La sociolingüística de basil bernstein y sus implicaciones en el ámbito escolar. Revista de educación.
- Melanie Weirich and Adrian P Simpson. 2018. Gender identity is indexed and perceived in speech. PLoS One, 13(12):e0209226.
- Transformers: State-of-the-art natural language processing. In Proceedings of the 2020 Conference on Empirical Methods in Natural Language Processing: System Demonstrations, pages 38–45, Online. Association for Computational Linguistics.
- Amanda Cercas Curry (18 papers)
- Giuseppe Attanasio (21 papers)
- Zeerak Talat (24 papers)
- Dirk Hovy (57 papers)