Systemic Biases in Sign Language AI Research: A Deaf-Led Call to Reevaluate Research Agendas (2403.02563v1)
Abstract: Growing research in sign language recognition, generation, and translation AI has been accompanied by calls for ethical development of such technologies. While these works are crucial to helping individual researchers do better, there is a notable lack of discussion of systemic biases or analysis of rhetoric that shape the research questions and methods in the field, especially as it remains dominated by hearing non-signing researchers. Therefore, we conduct a systematic review of 101 papers in sign language AI. Our analysis identifies significant biases in the current state of sign language AI research, including an overfocus on addressing perceived communication barriers, a lack of use of representative datasets, use of annotations lacking linguistic foundations, and development of methods that build on flawed models. We take the position that the field lacks meaningful input from Deaf stakeholders, and is instead driven by what decisions are the most convenient or perceived as important to hearing researchers. We end with a call to action: the field must make space for Deaf researchers to lead the conversation in sign language AI.
- Bbc-oxford british sign language dataset. arXiv preprint arXiv:2111.03635.
- Dhoest Alexander and Jorn Rijckaert. 2022. News ‘with’ or ‘in’ sign language? case study on the comprehensibility of sign language in news broadcasts. Perspectives, 30(4):627–642.
- Bridging the gap: Understanding the intersection of deaf and technical perspectives on signing avatars. A. Way, D. Shterionov, C. Rathmann & L. Leeson (eds.), Sign Language Machine Translation.
- Emily M Bender and Batya Friedman. 2018. Data statements for natural language processing: Toward mitigating system bias and enabling better science. Transactions of the Association for Computational Linguistics, 6:587–604.
- On the dangers of stochastic parrots: Can language models be too big? In Proceedings of the 2021 ACM conference on fairness, accountability, and transparency, pages 610–623.
- Ten principles of disability justice. WSQ: Women’s Studies Quarterly, 46(1):227–230.
- Carl Börstell. 2023. Ableist language teching over sign language research. In Proceedings of the Second Workshop on Resources and Representations for Under-Resourced Languages and Domains (RESOURCEFUL-2023), pages 1–10.
- The fate landscape of sign language ai datasets: An interdisciplinary perspective. ACM Transactions on Accessible Computing (TACCESS), 14(2):1–45.
- Realtime multi-person 2d pose estimation using part affinity fields. In Proceedings of the IEEE conference on computer vision and pattern recognition, pages 7291–7299.
- Joao Carreira and Andrew Zisserman. 2017. Quo vadis, action recognition? a new model and the kinetics dataset. In proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 6299–6308.
- 1001 small victories: Deaf academics and imposter syndrome. In The Palgrave handbook of imposter syndrome in higher education, pages 481–496. Springer.
- Leipzig glossing rules. conventions for interlinear morpheme-by-morpheme glosses. max planck institute for evolutionary anthropology, leizpig.
- Maartje De Meulder. 2021. Is “good enough” good enough? ethical and responsible development of sign language technologies. In Proceedings of the 1st International Workshop on Automatic Translation for Signed and Spoken Languages (AT4SSL), pages 12–22, Virtual. Association for Machine Translation in the Americas.
- Maartje De Meulder and Annalies Kusters. 2021. Twitter thread.
- The legal recognition of sign languages: Advocacy and outcomes around the world. Multilingual Matters.
- Challenges with sign language datasets for sign language recognition and translation. In Calzolari N, Béchet F, Blache P, Choukri K, Cieri C, Declerck T, Goggi S, Isahara H, Maegaard B, Mariani J, Mazo H, Odijk J, Piperidis S, editors. LREC 2022, 13th International Conference on Language Resources and Evaluation; 2022 June 20-25; Marseille, France. Paris: European Language Resources; 2022. 10 p. European Language Resources Association.
- Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee.
- Asl citizen: A community-sourced dataset for advancing isolated sign language recognition. Advances in Neural Information Processing Systems, 36.
- Alphapose: Whole-body regional multi-person pose estimation and tracking in real-time. IEEE Transactions on Pattern Analysis and Machine Intelligence.
- Extensions of the sign language recognition and translation corpus rwth-phoenix-weather. In LREC, pages 1911–1916.
- Best practices for sign language technology research. Universal Access in the Information Society, pages 1–9.
- The design of eco-feedback technology. In Proceedings of the SIGCHI conference on human factors in computing systems, pages 1999–2008.
- Moa Gärdenfors. 2021. The writing process and the written product in bimodal bilingual deaf and hard of hearing children. Languages, 6(2).
- Jon Henner and Octavian Robinson. 2023. Crip linguistics goes to school. Languages, 8(1):48.
- Joseph C. Hill. 2023. Overrepresentation of whiteness is in sign language as well: A commentary on “undoing competence: Coloniality, homogeneity, and the overrepresentation of whiteness in applied linguistics”. Language Learning, 73(S2):312–316.
- Julie Hochgesang. 2022a. Documenting signed language use while considering our spaces as a deaf* linguist.
- Julie Hochgesang. 2022b. Managing sign language acquisition video data: A personal journey in the organization and representation of signed data. The Open Handbook of Linguistic Data Management, pages 367–383.
- Julie A Hochgesang. 2014. Using design principles to consider representation of the hand in some notation systems. Sign Language Studies, 14(4):488–542.
- Julie A Hochgesang. 2019. Tyranny of glossing revisited: reconsidering representational practices of signed languages via best practices of data citation. In TISLR13, the 13th Conference of Theoretical Issues in Sign Language Research, Hamburg, Germany (September 26–28, 2019).
- W (h) ither the asl corpus?: Considering trends in signed corpus development. In Advances in Sign Language Corpus Linguistics, pages 287–308. John Benjamins.
- Gabrielle Hodge and Onno Crasborn. 2022. Good practices in annotation. In Signed language corpora, pages 46–89. Gallaudet University Press.
- Edgcon: Auto-assigner of iconicity ratings grounded by lexical properties to aid in generation of technical gestures. In Proceedings of the 38th ACM/SIGAPP Symposium on Applied Computing, pages 3–10.
- Lynn Hou. 2017. Negotiating language practices and language ideologies in fieldwork: A reflexive meta-documentation. Innovations in deaf studies: The role of deaf scholars, pages 339–360.
- Signbert+: Hand-model-aware self-supervised pre-training for sign language understanding. IEEE Transactions on Pattern Analysis and Machine Intelligence.
- Self-emphasizing network for continuous sign language recognition. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 37, pages 854–862.
- Signing outside the studio: Benchmarking background robustness for continuous sign language recognition. arXiv preprint arXiv:2211.00448.
- The sem-lex benchmark: Modeling asl signs and their phonemes. In Proceedings of the 25th International ACM SIGACCESS Conference on Computers and Accessibility, pages 1–10.
- Innovations in deaf studies: Critically mapping the field. Innovations in deaf studies: The role of deaf scholars, pages 1–53.
- Annelies Kusters and Ceil Lucas. 2022. Emergence and evolutions: Introducing sign language sociolinguistics. Journal of Sociolinguistics, 26(1):84–98.
- Beyond languages, beyond modalities: Transforming the study of semiotic repertoires. International Journal of multilingualism, 14(3):219–232.
- Word-level deep sign language recognition from video: A new large-scale dataset and methods comparison. In Proceedings of the IEEE/CVF winter conference on applications of computer vision, pages 1459–1469.
- Mediapipe: A framework for building perception pipelines. arXiv preprint arXiv:1906.08172.
- What do we mean by “accessibility research”? a literature survey of accessibility papers in chi and assets from 1994 to 2019. In Proceedings of the 2021 CHI Conference on Human Factors in Computing Systems, pages 1–18.
- Christopher D Mellinger. 2020. Positionality in public service interpreting research. FITISPos International Journal, 7(1):92–109.
- E. Morozov. 2013. To Save Everything, Click Here: Technology, Solutionism, and the Urge to Fix Problems that Don’t Exist. Penguin Books Limited.
- Evaluating the immediate applicability of pose estimation for sign language recognition. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, pages 3434–3440.
- Education and health of children with hearing loss: the necessity of signed languages. Bulletin of the World Health Organization, 97(10):711.
- Deaf professionals’ perceptions of trust in relationships with signed/spoken language interpreters. Translation & Interpreting, 15(2):25–42.
- The asl-lex 2.0 project: A database of lexical and phonological properties for 2,723 signs in american sign language. The Journal of Deaf Studies and Deaf Education, 26(2):263–277.
- Openhands: Making sign language recognition accessible with pose-based pretrained models across languages. arXiv preprint arXiv:2110.05877.
- Open-domain sign language translation learned from online video. arXiv preprint arXiv:2205.12870.
- SignOn. 2022. Sign language technology: Do’s and don’ts.
- Adhd and technology research–investigated by neurodivergent readers. In Proceedings of the 2022 CHI Conference on Human Factors in Computing Systems, pages 1–21.
- Phonology recognition in american sign language. In ICASSP 2022-2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), pages 8452–8456. IEEE.
- Read and attend: Temporal localisation in sign language videos. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 16857–16866.
- Gloss alignment using word embeddings. In 2023 IEEE International Conference on Acoustics, Speech, and Signal Processing Workshops (ICASSPW), pages 1–5. IEEE.
- Angelina Wang and Olga Russakovsky. 2023. Overwriting pretrained bias with finetuning data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3957–3968.
- Shuai Wang and Eric Nalisnick. 2023. Active learning for multilingual fingerspelling corpora. arXiv preprint arXiv:2309.12443.
- Including signed languages in natural language processing. In Proceedings of the 59th Annual Meeting of the Association for Computational Linguistics and the 11th International Joint Conference on Natural Language Processing (Volume 1: Long Papers), pages 7347–7360.
- Natural language-assisted sign language recognition. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pages 14890–14900.