Soft Layer Selection with Meta-Learning for Zero-Shot Cross-Lingual Transfer (2107.09840v1)

Published 21 Jul 2021 in cs.CL

Abstract: Multilingual pre-trained contextual embedding models (Devlin et al., 2019) have achieved impressive performance on zero-shot cross-lingual transfer tasks. Finding the most effective fine-tuning strategy to fine-tune these models on high-resource languages so that it transfers well to the zero-shot languages is a non-trivial task. In this paper, we propose a novel meta-optimizer to soft-select which layers of the pre-trained model to freeze during fine-tuning. We train the meta-optimizer by simulating the zero-shot transfer scenario. Results on cross-lingual natural language inference show that our approach improves over the simple fine-tuning baseline and X-MAML (Nooralahzadeh et al., 2020).

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (4)

Weijia Xu (23 papers)
Batool Haider (4 papers)
Jason Krone (9 papers)
Saab Mansour (32 papers)

Citations (6)

View on Semantic Scholar

Soft Layer Selection with Meta-Learning for Zero-Shot Cross-Lingual Transfer (2107.09840v1)

Related Papers