Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

MADI: Inter-domain Matching and Intra-domain Discrimination for Cross-domain Speech Recognition (2302.11224v1)

Published 22 Feb 2023 in cs.CL, cs.SD, and eess.AS

Abstract: End-to-end automatic speech recognition (ASR) usually suffers from performance degradation when applied to a new domain due to domain shift. Unsupervised domain adaptation (UDA) aims to improve the performance on the unlabeled target domain by transferring knowledge from the source to the target domain. To improve transferability, existing UDA approaches mainly focus on matching the distributions of the source and target domains globally and/or locally, while ignoring the model discriminability. In this paper, we propose a novel UDA approach for ASR via inter-domain MAtching and intra-domain DIscrimination (MADI), which improves the model transferability by fine-grained inter-domain matching and discriminability by intra-domain contrastive discrimination simultaneously. Evaluations on the Libri-Adapt dataset demonstrate the effectiveness of our approach. MADI reduces the relative word error rate (WER) on cross-device and cross-environment ASR by 17.7% and 22.8%, respectively.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Jiaming Zhou (41 papers)
  2. Shiwan Zhao (47 papers)
  3. Ning Jiang (177 papers)
  4. Guoqing Zhao (20 papers)
  5. Yong Qin (35 papers)
Citations (7)

Summary

We haven't generated a summary for this paper yet.