2000 character limit reached
Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS (2011.04845v2)
Published 10 Nov 2020 in cs.CL
Abstract: This paper presents a newly developed, simultaneous neural speech-to-speech translation system and its evaluation. The system consists of three fully-incremental neural processing modules for automatic speech recognition (ASR), machine translation (MT), and text-to-speech synthesis (TTS). We investigated its overall latency in the system's Ear-Voice Span and speaking latency along with module-level performance.
- Katsuhito Sudoh (35 papers)
- Takatomo Kano (9 papers)
- Sashi Novitasari (7 papers)
- Tomoya Yanagita (3 papers)
- Sakriani Sakti (41 papers)
- Satoshi Nakamura (94 papers)