Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Simultaneous Speech-to-Speech Translation System with Neural Incremental ASR, MT, and TTS (2011.04845v2)

Published 10 Nov 2020 in cs.CL

Abstract: This paper presents a newly developed, simultaneous neural speech-to-speech translation system and its evaluation. The system consists of three fully-incremental neural processing modules for automatic speech recognition (ASR), machine translation (MT), and text-to-speech synthesis (TTS). We investigated its overall latency in the system's Ear-Voice Span and speaking latency along with module-level performance.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Katsuhito Sudoh (35 papers)
  2. Takatomo Kano (9 papers)
  3. Sashi Novitasari (7 papers)
  4. Tomoya Yanagita (3 papers)
  5. Sakriani Sakti (41 papers)
  6. Satoshi Nakamura (94 papers)
Citations (13)

Summary

We haven't generated a summary for this paper yet.