Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Reducing Geographic Disparities in Automatic Speech Recognition via Elastic Weight Consolidation (2207.07850v1)

Published 16 Jul 2022 in eess.AS and cs.SD

Abstract: We present an approach to reduce the performance disparity between geographic regions without degrading performance on the overall user population for ASR. A popular approach is to fine-tune the model with data from regions where the ASR model has a higher word error rate (WER). However, when the ASR model is adapted to get better performance on these high-WER regions, its parameters wander from the previous optimal values, which can lead to worse performance in other regions. In our proposed method, we utilize the elastic weight consolidation (EWC) regularization loss to identify directions in parameters space along which the ASR weights can vary to improve for high-error regions, while still maintaining performance on the speaker population overall. Our results demonstrate that EWC can reduce the word error rate (WER) in the region with highest WER by 3.2% relative while reducing the overall WER by 1.3% relative. We also evaluate the role of language and acoustic models in ASR fairness and propose a clustering algorithm to identify WER disparities based on geographic region.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Viet Anh Trinh (12 papers)
  2. Pegah Ghahremani (3 papers)
  3. Brian King (16 papers)
  4. Jasha Droppo (24 papers)
  5. Andreas Stolcke (57 papers)
  6. Roland Maas (24 papers)
Citations (4)

Summary

We haven't generated a summary for this paper yet.