Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Efficient Indexing and Searching Technique for Information Retrieval for Urdu Language (2103.00532v1)

Published 28 Feb 2021 in cs.IR

Abstract: Indexing techniques are used to improve retrieval of data in response to certain search condition. Inverted files are mostly used for creating indexes. This paper proposes indexing technique for Urdu language. Language processing step in Index creation is different for a particular language. We discuss index creation steps specifically for Urdu language. We explore morphological rules for Urdu language and implement these rules to create Urdu stemmer. We implement our proposed technique with different implementations and compare results. We suggest that indexes should be created without stop words and also index file should be an order index file.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Muhammad Mudassar Qureshi (2 papers)
  2. Muhammad Shoaib (16 papers)
  3. Kalsoom (1 paper)

Summary

We haven't generated a summary for this paper yet.