Multilingual Adaptation of RNN Based ASR Systems (1711.04569v2)

Published 13 Nov 2017 in eess.AS, cs.AI, and cs.CL

Abstract: In this work, we focus on multilingual systems based on recurrent neural networks (RNNs), trained using the Connectionist Temporal Classification (CTC) loss function. Using a multilingual set of acoustic units poses difficulties. To address this issue, we proposed Language Feature Vectors (LFVs) to train language adaptive multilingual systems. Language adaptation, in contrast to speaker adaptation, needs to be applied not only on the feature level, but also to deeper layers of the network. In this work, we therefore extended our previous approach by introducing a novel technique which we call "modulation". Based on this method, we modulated the hidden layers of RNNs using LFVs. We evaluated this approach in both full and low resource conditions, as well as for grapheme and phone based systems. Lower error rates throughout the different conditions could be achieved by the use of the modulation.

Authors (3)

Markus Müller (114 papers)
Sebastian Stüker (11 papers)
Alex Waibel (48 papers)

Citations (17)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Multilingual Adaptation of RNN Based ASR Systems (1711.04569v2)

Summary

Related Papers