DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition (2211.08104v2)

Published 15 Nov 2022 in cs.CL

Abstract: We present DualNER, a simple and effective framework to make full use of both annotated source language corpus and unlabeled target language text for zero-shot cross-lingual named entity recognition (NER). In particular, we combine two complementary learning paradigms of NER, i.e., sequence labeling and span prediction, into a unified multi-task framework. After obtaining a sufficient NER model trained on the source data, we further train it on the target data in a {\it dual-teaching} manner, in which the pseudo-labels for one task are constructed from the prediction of the other task. Moreover, based on the span prediction, an entity-aware regularization is proposed to enhance the intrinsic cross-lingual alignment between the same entities in different languages. Experiments and analysis demonstrate the effectiveness of our DualNER. Code is available at https://github.com/lemon0830/dualNER.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (6)

Jiali Zeng (24 papers)
Yufan Jiang (17 papers)
Yongjing Yin (19 papers)
Xu Wang (319 papers)
Binghuai Lin (20 papers)
Yunbo Cao (43 papers)

Citations (1)

View on Semantic Scholar

GitHub

GitHub - lemon0830/dualNER: code for dualNER, emnlp 2022 (3 stars)

DualNER: A Dual-Teaching framework for Zero-shot Cross-lingual Named Entity Recognition (2211.08104v2)

Related Papers

GitHub