Low-resource neural machine translation with morphological modeling (2404.02392v1)

Published 3 Apr 2024 in cs.CL

Abstract: Morphological modeling in neural machine translation (NMT) is a promising approach to achieving open-vocabulary machine translation for morphologically-rich languages. However, existing methods such as sub-word tokenization and character-based models are limited to the surface forms of the words. In this work, we propose a framework-solution for modeling complex morphology in low-resource settings. A two-tier transformer architecture is chosen to encode morphological information at the inputs. At the target-side output, a multi-task multi-label training scheme coupled with a beam search-based decoder are found to improve machine translation performance. An attention augmentation scheme to the transformer model is proposed in a generic form to allow integration of pre-trained LLMs and also facilitate modeling of word order relationships between the source and target languages. Several data augmentation techniques are evaluated and shown to increase translation performance in low-resource settings. We evaluate our proposed solution on Kinyarwanda - English translation using public-domain parallel text. Our final models achieve competitive performance in relation to large multi-lingual models. We hope that our results will motivate more use of explicit morphological information and the proposed model and data augmentations in low-resource NMT.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (39)

Authors (1)

Antoine Nzeyimana (6 papers)

Citations (2)

View on Semantic Scholar

Low-resource neural machine translation with morphological modeling (2404.02392v1)

Related Papers