2000 character limit reached
CUNI Systems for the WMT22 Czech-Ukrainian Translation Task (2212.00486v1)
Published 1 Dec 2022 in cs.CL
Abstract: We present Charles University submissions to the WMT22 General Translation Shared Task on Czech-Ukrainian and Ukrainian-Czech machine translation. We present two constrained submissions based on block back-translation and tagged back-translation and experiment with rule-based romanization of Ukrainian. Our results show that the romanization only has a minor effect on the translation quality. Further, we describe Charles Translator, a system that was developed in March 2022 as a response to the migration from Ukraine to the Czech Republic. Compared to our constrained systems, it did not use the romanization and used some proprietary data sources.
- Martin Popel (14 papers)
- Jindřich Libovický (36 papers)
- Jindřich Helcl (21 papers)