Word-based Domain Adaptation for Neural Machine Translation (1906.03129v1)

Published 7 Jun 2019 in cs.CL and cs.AI

Abstract: In this paper, we empirically investigate applying word-level weights to adapt neural machine translation to e-commerce domains, where small e-commerce datasets and large out-of-domain datasets are available. In order to mine in-domain like words in the out-of-domain datasets, we compute word weights by using a domain-specific and a non-domain-specific LLM followed by smoothing and binary quantization. The baseline model is trained on mixed in-domain and out-of-domain datasets. Experimental results on English to Chinese e-commerce domain translation show that compared to continuing training without word weights, it improves MT quality by up to 2.11% BLEU absolute and 1.59% TER. We have also trained models using fine-tuning on the in-domain data. Pre-training a model with word weights improves fine-tuning up to 1.24% BLEU absolute and 1.64% TER, respectively.

Authors (5)

Shen Yan (47 papers)
Leonard Dahlmann (5 papers)
Pavel Petrushkov (9 papers)
Sanjika Hewavitharana (5 papers)
Shahram Khadivi (29 papers)

Citations (7)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Word-based Domain Adaptation for Neural Machine Translation (1906.03129v1)

Summary

Related Papers