Learning Policies for Multilingual Training of Neural Machine Translation Systems (2103.06964v1)

Published 11 Mar 2021 in cs.CL

Abstract: Low-resource Multilingual Neural Machine Translation (MNMT) is typically tasked with improving the translation performance on one or more language pairs with the aid of high-resource language pairs. In this paper, we propose two simple search based curricula -- orderings of the multilingual training data -- which help improve translation performance in conjunction with existing techniques such as fine-tuning. Additionally, we attempt to learn a curriculum for MNMT from scratch jointly with the training of the translation system with the aid of contextual multi-arm bandits. We show on the FLORES low-resource translation dataset that these learned curricula can provide better starting points for fine tuning and improve overall performance of the translation system.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (3)

Gaurav Kumar (46 papers)
Philipp Koehn (60 papers)
Sanjeev Khudanpur (74 papers)

Citations (1)

View on Semantic Scholar

Learning Policies for Multilingual Training of Neural Machine Translation Systems (2103.06964v1)

Related Papers