Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (2103.02262v1)

Published 3 Mar 2021 in cs.CL and cs.LG

Abstract: Meta-learning has been sufficiently validated to be beneficial for low-resource neural machine translation (NMT). However, we find that meta-trained NMT fails to improve the translation performance of the domain unseen at the meta-training stage. In this paper, we aim to alleviate this issue by proposing a novel meta-curriculum learning for domain adaptation in NMT. During meta-training, the NMT first learns the similar curricula from each domain to avoid falling into a bad local optimum early, and finally learns the curricula of individualities to improve the model robustness for learning domain-specific knowledge. Experimental results on 10 different low-resource domains show that meta-curriculum learning can improve the translation performance of both familiar and unfamiliar domains. All the codes and data are freely available at https://github.com/NLP2CT/Meta-Curriculum.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Runzhe Zhan (12 papers)
Xuebo Liu (54 papers)
Derek F. Wong (69 papers)
Lidia S. Chao (41 papers)

Citations (43)

View on Semantic Scholar

Tweets

https://twitter.com/cneuralnetwork/status/1918962490248933635

Meta-Curriculum Learning for Domain Adaptation in Neural Machine Translation (2103.02262v1)

Related Papers

Tweets