Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER (1909.00153v3)

Published 31 Aug 2019 in cs.CL and cs.LG

Abstract: Contextual word embeddings (e.g. GPT, BERT, ELMo, etc.) have demonstrated state-of-the-art performance on various NLP tasks. Recent work with the multilingual version of BERT has shown that the model performs very well in zero-shot and zero-resource cross-lingual settings, where only labeled English data is used to finetune the model. We improve upon multilingual BERT's zero-resource cross-lingual performance via adversarial learning. We report the magnitude of the improvement on the multilingual MLDoc text classification and CoNLL 2002/2003 named entity recognition tasks. Furthermore, we show that language-adversarial training encourages BERT to align the embeddings of English documents and their translations, which may be the cause of the observed performance gains.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (3)

Phillip Keung (11 papers)
Yichao Lu (22 papers)
Vikas Bhardwaj (9 papers)

Citations (80)

View on Semantic Scholar

Adversarial Learning with Contextual Embeddings for Zero-resource Cross-lingual Classification and NER (1909.00153v3)

Related Papers