Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DAMO-NLP at SemEval-2022 Task 11: A Knowledge-based System for Multilingual Named Entity Recognition (2203.00545v3)

Published 1 Mar 2022 in cs.CL and cs.LG

Abstract: The MultiCoNER shared task aims at detecting semantically ambiguous and complex named entities in short and low-context settings for multiple languages. The lack of contexts makes the recognition of ambiguous named entities challenging. To alleviate this issue, our team DAMO-NLP proposes a knowledge-based system, where we build a multilingual knowledge base based on Wikipedia to provide related context information to the named entity recognition (NER) model. Given an input sentence, our system effectively retrieves related contexts from the knowledge base. The original input sentences are then augmented with such context information, allowing significantly better contextualized token representations to be captured. Our system wins 10 out of 13 tracks in the MultiCoNER shared task.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Xinyu Wang (186 papers)
  2. Yongliang Shen (47 papers)
  3. Jiong Cai (6 papers)
  4. Tao Wang (700 papers)
  5. Xiaobin Wang (39 papers)
  6. Pengjun Xie (85 papers)
  7. Fei Huang (409 papers)
  8. Weiming Lu (54 papers)
  9. Yueting Zhuang (164 papers)
  10. Kewei Tu (74 papers)
  11. Wei Lu (325 papers)
  12. Yong Jiang (194 papers)
Citations (41)

Summary

We haven't generated a summary for this paper yet.