Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Adversarial Learning for Chinese NER from Crowd Annotations (1801.05147v1)

Published 16 Jan 2018 in cs.CL

Abstract: To quickly obtain new labeled data, we can choose crowdsourcing as an alternative way at lower cost in a short time. But as an exchange, crowd annotations from non-experts may be of lower quality than those from experts. In this paper, we propose an approach to performing crowd annotation learning for Chinese Named Entity Recognition (NER) to make full use of the noisy sequence labels from multiple annotators. Inspired by adversarial learning, our approach uses a common Bi-LSTM and a private Bi-LSTM for representing annotator-generic and -specific information. The annotator-generic information is the common knowledge for entities easily mastered by the crowd. Finally, we build our Chinese NE tagger based on the LSTM-CRF model. In our experiments, we create two data sets for Chinese NER tasks from two domains. The experimental results show that our system achieves better scores than strong baseline systems.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. YaoSheng Yang (1 paper)
  2. Meishan Zhang (70 papers)
  3. Wenliang Chen (33 papers)
  4. Wei Zhang (1489 papers)
  5. Haofen Wang (32 papers)
  6. Min Zhang (630 papers)
Citations (32)

Summary

We haven't generated a summary for this paper yet.