2000 character limit reached
Improved Neural Machine Translation with a Syntax-Aware Encoder and Decoder (1707.05436v1)
Published 18 Jul 2017 in cs.CL
Abstract: Most neural machine translation (NMT) models are based on the sequential encoder-decoder framework, which makes no use of syntactic information. In this paper, we improve this model by explicitly incorporating source-side syntactic trees. More specifically, we propose (1) a bidirectional tree encoder which learns both sequential and tree structured representations; (2) a tree-coverage model that lets the attention depend on the source-side syntax. Experiments on Chinese-English translation demonstrate that our proposed models outperform the sequential attentional model as well as a stronger baseline with a bottom-up tree encoder and word coverage.
- Huadong Chen (26 papers)
- Shujian Huang (106 papers)
- David Chiang (59 papers)
- Jiajun Chen (125 papers)