2000 character limit reached
CUNI Submission in WMT22 General Task (2211.16174v1)
Published 29 Nov 2022 in cs.CL
Abstract: We present the CUNI-Bergamot submission for the WMT22 General translation task. We compete in English$\rightarrow$Czech direction. Our submission further explores block backtranslation techniques. Compared to the previous work, we measure performance in terms of COMET score and named entities translation accuracy. We evaluate performance of MBR decoding compared to traditional mixed backtranslation training and we show a possible synergy when using both of the techniques simultaneously. The results show that both approaches are effective means of improving translation quality and they yield even better results when combined.
- Josef Jon (12 papers)
- Martin Popel (14 papers)
- Ondřej Bojar (91 papers)