Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

PAI at SemEval-2023 Task 2: A Universal System for Named Entity Recognition with External Entity Information (2305.06099v1)

Published 10 May 2023 in cs.CL and cs.AI

Abstract: The MultiCoNER II task aims to detect complex, ambiguous, and fine-grained named entities in low-context situations and noisy scenarios like the presence of spelling mistakes and typos for multiple languages. The task poses significant challenges due to the scarcity of contextual information, the high granularity of the entities(up to 33 classes), and the interference of noisy data. To address these issues, our team {\bf PAI} proposes a universal Named Entity Recognition (NER) system that integrates external entity information to improve performance. Specifically, our system retrieves entities with properties from the knowledge base (i.e. Wikipedia) for a given text, then concatenates entity information with the input sentence and feeds it into Transformer-based models. Finally, our system wins 2 first places, 4 second places, and 1 third place out of 13 tracks. The code is publicly available at \url{https://github.com/diqiuzhuanzhuan/semeval-2023}.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Long Ma (116 papers)
  2. Kai Lu (35 papers)
  3. Tianbo Che (1 paper)
  4. Hailong Huang (13 papers)
  5. Weiguo Gao (27 papers)
  6. Xuan Li (129 papers)
Citations (1)