Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS (2011.04845v2)

Published 10 Nov 2020 in cs.CL

Abstract: This paper presents a newly developed, simultaneous neural speech-to-speech translation system and its evaluation. The system consists of three fully-incremental neural processing modules for automatic speech recognition (ASR), machine translation (MT), and text-to-speech synthesis (TTS). We investigated its overall latency in the system's Ear-Voice Span and speaking latency along with module-level performance.

Authors (6)

Katsuhito Sudoh (35 papers)
Takatomo Kano (9 papers)
Sashi Novitasari (7 papers)
Tomoya Yanagita (3 papers)
Sakriani Sakti (41 papers)
Satoshi Nakamura (94 papers)

Citations (13)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS (2011.04845v2)

Summary

Related Papers