Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

The MeMAD Submission to the WMT18 Multimodal Translation Task (1808.10802v2)

Published 31 Aug 2018 in cs.CL

Abstract: This paper describes the MeMAD project entry to the WMT Multimodal Machine Translation Shared Task. We propose adapting the Transformer neural machine translation (NMT) architecture to a multi-modal setting. In this paper, we also describe the preliminary experiments with text-only translation systems leading us up to this choice. We have the top scoring system for both English-to-German and English-to-French, according to the automatic metrics for flickr18. Our experiments show that the effect of the visual features in our system is small. Our largest gains come from the quality of the underlying text-only NMT system. We find that appropriate use of additional data is effective.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (11)
  1. Stig-Arne Grönroos (11 papers)
  2. Benoit Huet (5 papers)
  3. Mikko Kurimo (27 papers)
  4. Jorma Laaksonen (37 papers)
  5. Bernard Merialdo (1 paper)
  6. Phu Pham (13 papers)
  7. Mats Sjöberg (2 papers)
  8. Umut Sulubacak (4 papers)
  9. Jörg Tiedemann (41 papers)
  10. Raúl Vázquez (12 papers)
  11. Raphael Troncy (4 papers)
Citations (62)