Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Call for Standardization and Validation of Text Style Transfer Evaluation (2306.00539v1)

Published 1 Jun 2023 in cs.LG and cs.CL

Abstract: Text Style Transfer (TST) evaluation is, in practice, inconsistent. Therefore, we conduct a meta-analysis on human and automated TST evaluation and experimentation that thoroughly examines existing literature in the field. The meta-analysis reveals a substantial standardization gap in human and automated evaluation. In addition, we also find a validation gap: only few automated metrics have been validated using human experiments. To this end, we thoroughly scrutinize both the standardization and validation gap and reveal the resulting pitfalls. This work also paves the way to close the standardization and validation gap in TST evaluation by calling out requirements to be met by future research.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Phil Ostheimer (7 papers)
  2. Mayank Nagda (7 papers)
  3. Marius Kloft (65 papers)
  4. Sophie Fellenz (21 papers)
Citations (10)

Summary

We haven't generated a summary for this paper yet.