Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Wait-info Policy: Balancing Source and Target at Information Level for Simultaneous Machine Translation (2210.11220v1)

Published 20 Oct 2022 in cs.CL and cs.AI

Abstract: Simultaneous machine translation (SiMT) outputs the translation while receiving the source inputs, and hence needs to balance the received source information and translated target information to make a reasonable decision between waiting for inputs or outputting translation. Previous methods always balance source and target information at the token level, either directly waiting for a fixed number of tokens or adjusting the waiting based on the current token. In this paper, we propose a Wait-info Policy to balance source and target at the information level. We first quantify the amount of information contained in each token, named info. Then during simultaneous translation, the decision of waiting or outputting is made based on the comparison results between the total info of previous target outputs and received source inputs. Experiments show that our method outperforms strong baselines under and achieves better balance via the proposed info.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Shaolei Zhang (36 papers)
  2. Shoutao Guo (17 papers)
  3. Yang Feng (230 papers)
Citations (23)

Summary

We haven't generated a summary for this paper yet.