Towards Neural Phrase-based Machine Translation (1706.05565v8)

Published 17 Jun 2017 in cs.CL and stat.ML

Abstract: In this paper, we present Neural Phrase-based Machine Translation (NPMT). Our method explicitly models the phrase structures in output sequences using Sleep-WAke Networks (SWAN), a recently proposed segmentation-based sequence modeling method. To mitigate the monotonic alignment requirement of SWAN, we introduce a new layer to perform (soft) local reordering of input sequences. Different from existing neural machine translation (NMT) approaches, NPMT does not use attention-based decoding mechanisms. Instead, it directly outputs phrases in a sequential order and can decode in linear time. Our experiments show that NPMT achieves superior performances on IWSLT 2014 German-English/English-German and IWSLT 2015 English-Vietnamese machine translation tasks compared with strong NMT baselines. We also observe that our method produces meaningful phrases in output languages.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (5)

Po-Sen Huang (30 papers)
Chong Wang (308 papers)
Sitao Huang (22 papers)
Dengyong Zhou (20 papers)
Li Deng (76 papers)

Citations (3)

View on Semantic Scholar

Towards Neural Phrase-based Machine Translation (1706.05565v8)

Related Papers