Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Machine Learning Approaches for Amharic Parts-of-speech Tagging (2001.03324v1)

Published 10 Jan 2020 in cs.CL, cs.IR, and cs.LG

Abstract: Part-of-speech (POS) tagging is considered as one of the basic but necessary tools which are required for many NLP applications such as word sense disambiguation, information retrieval, information processing, parsing, question answering, and machine translation. Performance of the current POS taggers in Amharic is not as good as that of the contemporary POS taggers available for English and other European languages. The aim of this work is to improve POS tagging performance for the Amharic language, which was never above 91%. Usage of morphological knowledge, an extension of the existing annotated data, feature extraction, parameter tuning by applying grid search and the tagging algorithms have been examined and obtained significant performance difference from the previous works. We have used three different datasets for POS experiments.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Ibrahim Gashaw (3 papers)
  2. H L. Shashirekha (4 papers)
Citations (16)