Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

UM4: Unified Multilingual Multiple Teacher-Student Model for Zero-Resource Neural Machine Translation (2207.04900v2)

Published 11 Jul 2022 in cs.CL, cs.AI, and cs.LG

Abstract: Most translation tasks among languages belong to the zero-resource translation problem where parallel corpora are unavailable. Multilingual neural machine translation (MNMT) enables one-pass translation using shared semantic space for all languages compared to the two-pass pivot translation but often underperforms the pivot-based method. In this paper, we propose a novel method, named as Unified Multilingual Multiple teacher-student Model for NMT (UM4). Our method unifies source-teacher, target-teacher, and pivot-teacher models to guide the student model for the zero-resource translation. The source teacher and target teacher force the student to learn the direct source to target translation by the distilled knowledge on both source and target sides. The monolingual corpus is further leveraged by the pivot-teacher model to enhance the student model. Experimental results demonstrate that our model of 72 directions significantly outperforms previous methods on the WMT benchmark.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Jian Yang (505 papers)
  2. Yuwei Yin (21 papers)
  3. Shuming Ma (83 papers)
  4. Dongdong Zhang (79 papers)
  5. Shuangzhi Wu (29 papers)
  6. Hongcheng Guo (39 papers)
  7. Zhoujun Li (122 papers)
  8. Furu Wei (291 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.