Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
38 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On the Evaluation of Machine Translation for Terminology Consistency (2106.11891v2)

Published 22 Jun 2021 in cs.CL

Abstract: As neural machine translation (NMT) systems become an important part of professional translator pipelines, a growing body of work focuses on combining NMT with terminologies. In many scenarios and particularly in cases of domain adaptation, one expects the MT output to adhere to the constraints provided by a terminology. In this work, we propose metrics to measure the consistency of MT output with regards to a domain terminology. We perform studies on the COVID-19 domain over 5 languages, also performing terminology-targeted human evaluation. We open-source the code for computing all proposed metrics: https://github.com/mahfuzibnalam/terminology_evaluation

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Md Mahfuz ibn Alam (9 papers)
  2. Antonios Anastasopoulos (111 papers)
  3. Laurent Besacier (76 papers)
  4. James Cross (22 papers)
  5. Matthias Gallé (31 papers)
  6. Philipp Koehn (60 papers)
  7. Vassilina Nikoulina (28 papers)
Citations (32)
Github Logo Streamline Icon: https://streamlinehq.com