Rethinking Document-level Neural Machine Translation (2010.08961v2)

Published 18 Oct 2020 in cs.CL

Abstract: This paper does not aim at introducing a novel model for document-level neural machine translation. Instead, we head back to the original Transformer model and hope to answer the following question: Is the capacity of current models strong enough for document-level translation? Interestingly, we observe that the original Transformer with appropriate training techniques can achieve strong results for document translation, even with a length of 2000 words. We evaluate this model and several recent approaches on nine document-level datasets and two sentence-level datasets across six languages. Experiments show that document-level Transformer models outperforms sentence-level ones and many previous methods in a comprehensive set of metrics, including BLEU, four lexical indices, three newly proposed assistant linguistic indicators, and human evaluation.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (7)

Zewei Sun (15 papers)
Mingxuan Wang (83 papers)
Hao Zhou (351 papers)
Chengqi Zhao (15 papers)
Shujian Huang (106 papers)
Jiajun Chen (125 papers)
Lei Li (1293 papers)

Citations (42)

View on Semantic Scholar

Rethinking Document-level Neural Machine Translation (2010.08961v2)

Related Papers