Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Incremental Learning for End-to-End Automatic Speech Recognition (2005.04288v3)

Published 11 May 2020 in eess.AS, cs.CL, cs.LG, cs.SD, and stat.ML

Abstract: In this paper, we propose an incremental learning method for end-to-end Automatic Speech Recognition (ASR) which enables an ASR system to perform well on new tasks while maintaining the performance on its originally learned ones. To mitigate catastrophic forgetting during incremental learning, we design a novel explainability-based knowledge distillation for ASR models, which is combined with a response-based knowledge distillation to maintain the original model's predictions and the "reason" for the predictions. Our method works without access to the training data of original tasks, which addresses the cases where the previous data is no longer available or joint training is costly. Results on a multi-stage sequential training task show that our method outperforms existing ones in mitigating forgetting. Furthermore, in two practical scenarios, compared to the target-reference joint training method, the performance drop of our method is 0.02% Character Error Rate (CER), which is 97% smaller than the drops of the baseline methods.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Li Fu (24 papers)
  2. Xiaoxiao Li (144 papers)
  3. Libo Zi (2 papers)
  4. Zhengchen Zhang (9 papers)
  5. Youzheng Wu (32 papers)
  6. Xiaodong He (162 papers)
  7. Bowen Zhou (141 papers)
Citations (22)

Summary

We haven't generated a summary for this paper yet.