2000 character limit reached
Natural Language Processing in Ethiopian Languages: Current State, Challenges, and Opportunities (2303.14406v1)
Published 25 Mar 2023 in cs.CL
Abstract: This survey delves into the current state of NLP for four Ethiopian languages: Amharic, Afaan Oromo, Tigrinya, and Wolaytta. Through this paper, we identify key challenges and opportunities for NLP research in Ethiopia. Furthermore, we provide a centralized repository on GitHub that contains publicly available resources for various NLP tasks in these languages. This repository can be updated periodically with contributions from other researchers. Our objective is to identify research gaps and disseminate the information to NLP researchers interested in Ethiopian languages and encourage future research in this domain.