Rephrasing the Reference for Non-Autoregressive Machine Translation (2211.16863v1)

Published 30 Nov 2022 in cs.CL

Abstract: Non-autoregressive neural machine translation (NAT) models suffer from the multi-modality problem that there may exist multiple possible translations of a source sentence, so the reference sentence may be inappropriate for the training when the NAT output is closer to other translations. In response to this problem, we introduce a rephraser to provide a better training target for NAT by rephrasing the reference sentence according to the NAT output. As we train NAT based on the rephraser output rather than the reference sentence, the rephraser output should fit well with the NAT output and not deviate too far from the reference, which can be quantified as reward functions and optimized by reinforcement learning. Experiments on major WMT benchmarks and NAT baselines show that our approach consistently improves the translation quality of NAT. Specifically, our best variant achieves comparable performance to the autoregressive Transformer, while being 14.7 times more efficient in inference.

Authors (4)

Chenze Shao (22 papers)
Jinchao Zhang (49 papers)
Jie Zhou (687 papers)
Yang Feng (230 papers)

Citations (5)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Rephrasing the Reference for Non-Autoregressive Machine Translation (2211.16863v1)

Summary

Related Papers