Pointer-based Fusion of Bilingual Lexicons into Neural Machine Translation (1909.07907v1)

Published 17 Sep 2019 in cs.CL

Abstract: Neural machine translation (NMT) systems require large amounts of high quality in-domain parallel corpora for training. State-of-the-art NMT systems still face challenges related to out-of-vocabulary words and dealing with low-resource language pairs. In this paper, we propose and compare several models for fusion of bilingual lexicons with an end-to-end trained sequence-to-sequence model for machine translation. The result is a fusion model with two information sources for the decoder: a neural conditional LLM and a bilingual lexicon. This fusion model learns how to combine both sources of information in order to produce higher quality translation output. Our experiments show that our proposed models work well in relatively low-resource scenarios, and also effectively reduce the parameter size and training cost for NMT without sacrificing performance.

Authors (3)

Jetic Gū (3 papers)
Hassan S. Shavarani (6 papers)
Anoop Sarkar (11 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Pointer-based Fusion of Bilingual Lexicons into Neural Machine Translation (1909.07907v1)

Summary

Related Papers