Character-Aware Decoder for Translation into Morphologically Rich Languages (1809.02223v5)

Published 6 Sep 2018 in cs.CL

Abstract: Neural machine translation (NMT) systems operate primarily on words (or sub-words), ignoring lower-level patterns of morphology. We present a character-aware decoder designed to capture such patterns when translating into morphologically rich languages. We achieve character-awareness by augmenting both the softmax and embedding layers of an attention-based encoder-decoder model with convolutional neural networks that operate on the spelling of a word. To investigate performance on a wide variety of morphological phenomena, we translate English into 14 typologically diverse target languages using the TED multi-target dataset. In this low-resource setting, the character-aware decoder provides consistent improvements with BLEU score gains of up to $+3.05$. In addition, we analyze the relationship between the gains obtained and properties of the target language and find evidence that our model does indeed exploit morphological patterns.

PDF Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

Authors (4)

Adithya Renduchintala (17 papers)
Pamela Shapiro (4 papers)
Kevin Duh (64 papers)
Philipp Koehn (60 papers)

Citations (4)

View on Semantic Scholar

Character-Aware Decoder for Translation into Morphologically Rich Languages (1809.02223v5)

Related Papers