Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Lanfrica: A Participatory Approach to Documenting Machine Translation Research on African Languages (2008.07302v1)

Published 3 Aug 2020 in cs.CY and cs.CL

Abstract: Over the years, there have been campaigns to include the African languages in the growing research on machine translation (MT) in particular, and NLP in general. Africa has the highest language diversity, with 1500-2000 documented languages and many more undocumented or extinct languages(Lewis, 2009; Bendor-Samuel, 2017). This makes it hard to keep track of the MT research, models and dataset that have been developed for some of them. As the internet and social media make up the daily lives of more than half of the world(Lin, 2020), as well as over 40% of Africans(Campbell, 2019), online platforms can be useful in creating accessibility to researches, benchmarks and datasets in these African languages, thereby improving reproducibility and sharing of existing research and their results. In this paper, we introduce Lanfrica, a novel, on-going framework that employs a participatory approach to documenting researches, projects, benchmarks and dataset on African languages.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Chris C. Emezue (6 papers)
  2. Bonaventure F. P. Dossou (30 papers)
Citations (5)