Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Breaking the Data Barrier: Towards Robust Speech Translation via Adversarial Stability Training (1909.11430v3)

Published 25 Sep 2019 in cs.CL and eess.AS

Abstract: In a pipeline speech translation system, automatic speech recognition (ASR) system will transmit errors in recognition to the downstream machine translation (MT) system. A standard machine translation system is usually trained on parallel corpus composed of clean text and will perform poorly on text with recognition noise, a gap well known in speech translation community. In this paper, we propose a training architecture which aims at making a neural machine translation model more robust against speech recognition errors. Our approach addresses the encoder and the decoder simultaneously using adversarial learning and data augmentation, respectively. Experimental results on IWSLT2018 speech translation task show that our approach can bridge the gap between the ASR output and the MT input, outperforms the baseline by up to 2.83 BLEU on noisy ASR output, while maintaining close performance on clean text.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Qiao Cheng (6 papers)
  2. Meiyuan Fang (1 paper)
  3. Yaqian Han (4 papers)
  4. Jin Huang (80 papers)
  5. Yitao Duan (10 papers)
Citations (15)

Summary

We haven't generated a summary for this paper yet.