Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Exploration of Data Augmentation Techniques for Improving English to Tigrinya Translation (2103.16789v2)

Published 31 Mar 2021 in cs.CL

Abstract: It has been shown that the performance of neural machine translation (NMT) drops starkly in low-resource conditions, often requiring large amounts of auxiliary data to achieve competitive results. An effective method of generating auxiliary data is back-translation of target language sentences. In this work, we present a case study of Tigrinya where we investigate several back-translation methods to generate synthetic source sentences. We find that in low-resource conditions, back-translation by pivoting through a higher-resource language related to the target language proves most effective resulting in substantial improvements over baselines.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Lidia Kidane (1 paper)
  2. Sachin Kumar (68 papers)
  3. Yulia Tsvetkov (142 papers)
Citations (5)