Specializing Multi-domain NMT via Penalizing Low Mutual Information (2210.12910v1)

Published 24 Oct 2022 in cs.CL and cs.AI

Abstract: Multi-domain Neural Machine Translation (NMT) trains a single model with multiple domains. It is appealing because of its efficacy in handling multiple domains within one model. An ideal multi-domain NMT should learn distinctive domain characteristics simultaneously, however, grasping the domain peculiarity is a non-trivial task. In this paper, we investigate domain-specific information through the lens of mutual information (MI) and propose a new objective that penalizes low MI to become higher. Our method achieved the state-of-the-art performance among the current competitive multi-domain NMT models. Also, we empirically show our objective promotes low MI to be higher resulting in domain-specialized multi-domain NMT.

Authors (5)

Jiyoung Lee (42 papers)
Hantae Kim (3 papers)
Hyunchang Cho (4 papers)
Edward Choi (90 papers)
Cheonbok Park (20 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Specializing Multi-domain NMT via Penalizing Low Mutual Information (2210.12910v1)

Summary

Related Papers