2000 character limit reached
Correcting Length Bias in Neural Machine Translation (1808.10006v2)
Published 29 Aug 2018 in cs.CL
Abstract: We study two problems in neural machine translation (NMT). First, in beam search, whereas a wider beam should in principle help translation, it often hurts NMT. Second, NMT has a tendency to produce translations that are too short. Here, we argue that these problems are closely related and both rooted in label bias. We show that correcting the brevity problem almost eliminates the beam problem; we compare some commonly-used methods for doing this, finding that a simple per-word reward works well; and we introduce a simple and quick way to tune this reward using the perceptron algorithm.
- Kenton Murray (37 papers)
- David Chiang (59 papers)