Efficient Machine Translation Domain Adaptation (2204.12608v1)

Published 26 Apr 2022 in cs.CL

Abstract: Machine translation models struggle when translating out-of-domain text, which makes domain adaptation a topic of critical importance. However, most domain adaptation methods focus on fine-tuning or training the entire or part of the model on every new domain, which can be costly. On the other hand, semi-parametric models have been shown to successfully perform domain adaptation by retrieving examples from an in-domain datastore (Khandelwal et al., 2021). A drawback of these retrieval-augmented models, however, is that they tend to be substantially slower. In this paper, we explore several approaches to speed up nearest neighbor machine translation. We adapt the methods recently proposed by He et al. (2021) for LLMing, and introduce a simple but effective caching strategy that avoids performing retrieval when similar contexts have been seen before. Translation quality and runtimes for several domains show the effectiveness of the proposed solutions.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (3)

Pedro Henrique Martins (11 papers)
Zita Marinho (15 papers)
André F. T. Martins (113 papers)

Citations (14)

View on Semantic Scholar

Efficient Machine Translation Domain Adaptation (2204.12608v1)

Related Papers