NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging (2112.00405v1)

Published 1 Dec 2021 in cs.CL and cs.AI

Abstract: Named entity recognition (NER) models generally perform poorly when large training datasets are unavailable for low-resource domains. Recently, pre-training a large-scale LLM has become a promising direction for coping with the data scarcity issue. However, the underlying discrepancies between the LLMing and NER task could limit the models' performance, and pre-training for the NER task has rarely been studied since the collected NER datasets are generally small or large but with low quality. In this paper, we construct a massive NER corpus with a relatively high quality, and we pre-train a NER-BERT model based on the created dataset. Experimental results show that our pre-trained model can significantly outperform BERT as well as other strong baselines in low-resource scenarios across nine diverse domains. Moreover, a visualization of entity representations further indicates the effectiveness of NER-BERT for categorizing a variety of entities.

Authors (5)

Zihan Liu (102 papers)
Feijun Jiang (13 papers)
Yuxiang Hu (25 papers)
Chen Shi (55 papers)
Pascale Fung (151 papers)

Citations (34)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

NER-BERT: A Pre-trained Model for Low-Resource Entity Tagging (2112.00405v1)

Summary

Related Papers