Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Machine Translation of Restaurant Reviews: New Corpus for Domain Adaptation and Robustness (1910.14589v1)

Published 31 Oct 2019 in cs.CL

Abstract: We share a French-English parallel corpus of Foursquare restaurant reviews (https://europe.naverlabs.com/research/natural-language-processing/machine-translation-of-restaurant-reviews), and define a new task to encourage research on Neural Machine Translation robustness and domain adaptation, in a real-world scenario where better-quality MT would be greatly beneficial. We discuss the challenges of such user-generated content, and train good baseline models that build upon the latest techniques for MT robustness. We also perform an extensive evaluation (automatic and human) that shows significant improvements over existing online systems. Finally, we propose task-specific metrics based on sentiment analysis or translation accuracy of domain-specific polysemous words.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Ioan Calapodescu (12 papers)
  2. Marc Dymetman (21 papers)
  3. Claude Roux (4 papers)
  4. Jean-Luc Meunier (10 papers)
  5. Vassilina Nikoulina (28 papers)
  6. Alexandre Bérard (10 papers)
Citations (27)

Summary

We haven't generated a summary for this paper yet.