Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Masakhane -- Machine Translation For Africa (2003.11529v1)

Published 13 Mar 2020 in cs.CL

Abstract: Africa has over 2000 languages. Despite this, African languages account for a small portion of available resources and publications in NLP. This is due to multiple factors, including: a lack of focus from government and funding, discoverability, a lack of community, sheer language complexity, difficulty in reproducing papers and no benchmarks to compare techniques. To begin to address the identified problems, MASAKHANE, an open-source, continent-wide, distributed, online research effort for machine translation for African languages, was founded. In this paper, we discuss our methodology for building the community and spurring research from the African continent, as well as outline the success of the community in terms of addressing the identified problems affecting African NLP.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (25)
  1. Iroro Orife (20 papers)
  2. Julia Kreutzer (44 papers)
  3. Blessing Sibanda (8 papers)
  4. Daniel Whitenack (5 papers)
  5. Kathleen Siminyu (8 papers)
  6. Laura Martinus (6 papers)
  7. Jamiil Toure Ali (3 papers)
  8. Jade Abbott (8 papers)
  9. Vukosi Marivate (47 papers)
  10. Salomon Kabongo (10 papers)
  11. Musie Meressa (2 papers)
  12. Espoir Murhabazi (2 papers)
  13. Orevaoghene Ahia (23 papers)
  14. Elan van Biljon (7 papers)
  15. Arshath Ramkilowan (2 papers)
  16. Adewale Akinfaderin (7 papers)
  17. Alp Öktem (8 papers)
  18. Wole Akin (1 paper)
  19. Ghollah Kioko (2 papers)
  20. Kevin Degila (3 papers)
Citations (65)