Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English (2503.12858v1)

Published 17 Mar 2025 in cs.CL and cs.LG

Abstract: Test-time adaptation (TTA) is an excellent method which helps generalize models across domains, tasks, and distributions without the use of labeled datasets. Thus, TTA is very useful in NLP in the dialectal setting, since oftentimes, models are trained on Standard American English (SAE), evaluated on Indian English or Nigerian English, of which distribution differs significantly from the former. This is especially useful since dialectal datasets are scarce. In this paper, we explore one of the most famous TTA techniques, SHOT, in dialectal NLP. We finetune and evaluate SHOT on different combinations of dialectal GLUE. Our findings show that SHOT is a viable technique when labeled datasets are unavailable. We also theoretically propose the concept of dialectal gap and show that it has a positive correlation with the effectiveness of SHOT. We also find that in many cases, finetuning on SAE yields higher performance than finetuning on dialectal data. Our code is available at https://github.com/dukenguyenxyz/dialect-adaptation

Authors (3)

Duke Nguyen (4 papers)
Aditya Joshi (43 papers)
Flora Salim (37 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - dukenguyenxyz/dialect-adaptation: Leverage SHOT-based Test-Time Adaptation to bridge the gap between models trained on SAE and dialects like Indian, Nigerian, and Singaporean—all without extra labeled data.

Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English (2503.12858v1)

Summary

Related Papers

GitHub