Application of Pre-training Models in Named Entity Recognition (2002.08902v1)

Published 9 Feb 2020 in cs.CL, cs.LG, and stat.ML

Abstract: Named Entity Recognition (NER) is a fundamental NLP task to extract entities from unstructured data. The previous methods for NER were based on machine learning or deep learning. Recently, pre-training models have significantly improved performance on multiple NLP tasks. In this paper, firstly, we introduce the architecture and pre-training tasks of four common pre-training models: BERT, ERNIE, ERNIE2.0-tiny, and RoBERTa. Then, we apply these pre-training models to a NER task by fine-tuning, and compare the effects of the different model architecture and pre-training tasks on the NER task. The experiment results showed that RoBERTa achieved state-of-the-art results on the MSRA-2006 dataset.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (6)

Yu Wang (939 papers)
Yining Sun (8 papers)
Zuchang Ma (1 paper)
Lisheng Gao (2 papers)
Yang Xu (277 papers)
Ting Sun (26 papers)

Citations (20)

View on Semantic Scholar

Application of Pre-training Models in Named Entity Recognition (2002.08902v1)

Related Papers