Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Survey on Arabic Named Entity Recognition: Past, Recent Advances, and Future Trends (2302.03512v3)

Published 7 Feb 2023 in cs.CL

Abstract: As more and more Arabic texts emerged on the Internet, extracting important information from these Arabic texts is especially useful. As a fundamental technology, Named entity recognition (NER) serves as the core component in information extraction technology, while also playing a critical role in many other NLP systems, such as question answering and knowledge graph building. In this paper, we provide a comprehensive review of the development of Arabic NER, especially the recent advances in deep learning and pre-trained LLM. Specifically, we first introduce the background of Arabic NER, including the characteristics of Arabic and existing resources for Arabic NER. Then, we systematically review the development of Arabic NER methods. Traditional Arabic NER systems focus on feature engineering and designing domain-specific rules. In recent years, deep learning methods achieve significant progress by representing texts via continuous vector representations. With the growth of pre-trained LLM, Arabic NER yields better performance. Finally, we conclude the method gap between Arabic NER and NER methods from other languages, which helps outline future directions for Arabic NER.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Xiaoye Qu (62 papers)
  2. Yingjie Gu (5 papers)
  3. Qingrong Xia (13 papers)
  4. Zechang Li (4 papers)
  5. Zhefeng Wang (39 papers)
  6. Baoxing Huai (28 papers)
Citations (13)
Youtube Logo Streamline Icon: https://streamlinehq.com