Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Enhancing Translation for Indigenous Languages: Experiments with Multilingual Models (2305.17406v1)

Published 27 May 2023 in cs.CL

Abstract: This paper describes CIC NLP's submission to the AmericasNLP 2023 Shared Task on machine translation systems for indigenous languages of the Americas. We present the system descriptions for three methods. We used two multilingual models, namely M2M-100 and mBART50, and one bilingual (one-to-one) -- Helsinki NLP Spanish-English translation model, and experimented with different transfer learning setups. We experimented with 11 languages from America and report the setups we used as well as the results we achieved. Overall, the mBART setup was able to improve upon the baseline for three out of the eleven languages.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Atnafu Lambebo Tonja (27 papers)
  2. Hellina Hailu Nigatu (6 papers)
  3. Olga Kolesnikova (24 papers)
  4. Grigori Sidorov (45 papers)
  5. Alexander Gelbukh (52 papers)
  6. Jugal Kalita (64 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.