Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Few-Shot Named Entity Recognition: A Comprehensive Study (2012.14978v1)

Published 29 Dec 2020 in cs.CL, cs.IR, and cs.LG

Abstract: This paper presents a comprehensive study to efficiently build named entity recognition (NER) systems when a small number of in-domain labeled data is available. Based upon recent Transformer-based self-supervised pre-trained LLMs (PLMs), we investigate three orthogonal schemes to improve the model generalization ability for few-shot settings: (1) meta-learning to construct prototypes for different entity types, (2) supervised pre-training on noisy web data to extract entity-related generic representations and (3) self-training to leverage unlabeled in-domain data. Different combinations of these schemes are also considered. We perform extensive empirical comparisons on 10 public NER datasets with various proportions of labeled data, suggesting useful insights for future research. Our experiments show that (i) in the few-shot learning setting, the proposed NER schemes significantly improve or outperform the commonly used baseline, a PLM-based linear classifier fine-tuned on domain labels; (ii) We create new state-of-the-art results on both few-shot and training-free settings compared with existing methods. We will release our code and pre-trained models for reproducible research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Jiaxin Huang (48 papers)
  2. Chunyuan Li (122 papers)
  3. Krishan Subudhi (2 papers)
  4. Damien Jose (7 papers)
  5. Shobana Balakrishnan (2 papers)
  6. Weizhu Chen (128 papers)
  7. Baolin Peng (72 papers)
  8. Jianfeng Gao (344 papers)
  9. Jiawei Han (263 papers)
Citations (76)