It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT (2010.08275v1)

Published 16 Oct 2020 in cs.CL

Abstract: Recent works have demonstrated that multilingual BERT (mBERT) learns rich cross-lingual representations, that allow for transfer across languages. We study the word-level translation information embedded in mBERT and present two simple methods that expose remarkable translation capabilities with no fine-tuning. The results suggest that most of this information is encoded in a non-linear way, while some of it can also be recovered with purely linear tools. As part of our analysis, we test the hypothesis that mBERT learns representations which contain both a language-encoding component and an abstract, cross-lingual component, and explicitly identify an empirical language-identity subspace within mBERT representations.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Hila Gonen (30 papers)
Shauli Ravfogel (38 papers)
Yanai Elazar (44 papers)
Yoav Goldberg (142 papers)

Citations (47)

View on Semantic Scholar

It's not Greek to mBERT: Inducing Word-Level Translations from Multilingual BERT (2010.08275v1)

Related Papers