Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Named Entity Recognition System for Sindhi Language (1910.03475v1)

Published 28 Sep 2019 in cs.CL

Abstract: Named Entity Recognition (NER) System aims to extract the existing information into the following categories such as: Persons Name, Organization, Location, Date and Time, Term, Designation and Short forms. Now, it is considered to be important aspect for many natural languages processing (NLP) tasks such as: information retrieval system, machine translation system, information extraction system and question answering. Even at a surface level, the understanding of the named entities involved in a document gives richer analytical framework and cross referencing. It has been used for different Arabic Script-Based languages like, Arabic, Persian and Urdu but, Sindhi could not come into being yet. This paper explains the problem of NER in the framework of Sindhi Language and provides relevant solution. The system is developed to tag ten different Named Entities. We have used Ruled based approach for NER system of Sindhi Language. For the training and testing, 936 words were used and calculated performance accuracy of 98.71%.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Awais Khan Jumani (3 papers)
  2. Mashooque Ahmed Memon (3 papers)
  3. Fida Hussain Khoso (2 papers)
  4. Anwar Ali Sanjrani (2 papers)
  5. Safeeullah Soomro (28 papers)
Citations (9)

Summary

We haven't generated a summary for this paper yet.