Diving Deep into Context-Aware Neural Machine Translation (2010.09482v1)

Published 19 Oct 2020 in cs.CL and cs.AI

Abstract: Context-aware neural machine translation (NMT) is a promising direction to improve the translation quality by making use of the additional context, e.g., document-level translation, or having meta-information. Although there exist various architectures and analyses, the effectiveness of different context-aware NMT models is not well explored yet. This paper analyzes the performance of document-level NMT models on four diverse domains with a varied amount of parallel document-level bilingual data. We conduct a comprehensive set of experiments to investigate the impact of document-level NMT. We find that there is no single best approach to document-level NMT, but rather that different architectures come out on top on different tasks. Looking at task-specific problems, such as pronoun resolution or headline translation, we find improvements in the context-aware systems, even in cases where the corpus-level metrics like BLEU show no significant improvement. We also show that document-level back-translation significantly helps to compensate for the lack of document-level bi-texts.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (6)

Jingjing Huo (3 papers)
Christian Herold (20 papers)
Yingbo Gao (15 papers)
Leonard Dahlmann (5 papers)
Shahram Khadivi (29 papers)
Hermann Ney (104 papers)

Citations (22)

View on Semantic Scholar

Diving Deep into Context-Aware Neural Machine Translation (2010.09482v1)

Related Papers