Polish to English Statistical Machine Translation

Published 30 Sep 2015 in cs.CL and stat.ML | (1510.00001v1)

Abstract: This research explores the effects of various training settings on a Polish to English Statistical Machine Translation system for spoken language. Various elements of the TED, Europarl, and OPUS parallel text corpora were used as the basis for training of LLMs, for development, tuning and testing of the translation system. The BLEU, NIST, METEOR and TER metrics were used to evaluate the effects of the data preparations on the translation results.