Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Centroid-Based Efficient Minimum Bayes Risk Decoding (2402.11197v2)

Published 17 Feb 2024 in cs.CL

Abstract: Minimum Bayes risk (MBR) decoding achieved state-of-the-art translation performance by using COMET, a neural metric that has a high correlation with human evaluation. However, MBR decoding requires quadratic time since it computes the expected score between a translation hypothesis and all reference translations. We propose centroid-based MBR (CBMBR) decoding to improve the speed of MBR decoding. Our method clusters the reference translations in the feature space, and then calculates the score using the centroids of each cluster. The experimental results show that our CBMBR not only improved the decoding speed of the expected score calculation 5.7 times, but also outperformed vanilla MBR decoding in translation quality by up to 0.5 COMET in the WMT'22 En$\leftrightarrow$Ja, En$\leftrightarrow$De, En$\leftrightarrow$Zh, and WMT'23 En$\leftrightarrow$Ja translation tasks.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Hiroyuki Deguchi (8 papers)
  2. Yusuke Sakai (36 papers)
  3. Hidetaka Kamigaito (62 papers)
  4. Taro Watanabe (76 papers)
  5. Hideki Tanaka (6 papers)
  6. Masao Utiyama (39 papers)
Citations (7)
X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets