Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Evaluating the Evaluation Metrics for Style Transfer: A Case Study in Multilingual Formality Transfer (2110.10668v1)

Published 20 Oct 2021 in cs.CL

Abstract: While the field of style transfer (ST) has been growing rapidly, it has been hampered by a lack of standardized practices for automatic evaluation. In this paper, we evaluate leading ST automatic metrics on the oft-researched task of formality style transfer. Unlike previous evaluations, which focus solely on English, we expand our focus to Brazilian-Portuguese, French, and Italian, making this work the first multilingual evaluation of metrics in ST. We outline best practices for automatic evaluation in (formality) style transfer and identify several models that correlate well with human judgments and are robust across languages. We hope that this work will help accelerate development in ST, where human evaluation is often challenging to collect.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Eleftheria Briakou (21 papers)
  2. Sweta Agrawal (35 papers)
  3. Joel Tetreault (37 papers)
  4. Marine Carpuat (56 papers)
Citations (28)

Summary

We haven't generated a summary for this paper yet.