Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multidimensional Evaluation for Text Style Transfer Using ChatGPT (2304.13462v1)

Published 26 Apr 2023 in cs.CL

Abstract: We investigate the potential of ChatGPT as a multidimensional evaluator for the task of \emph{Text Style Transfer}, alongside, and in comparison to, existing automatic metrics as well as human judgements. We focus on a zero-shot setting, i.e. prompting ChatGPT with specific task instructions, and test its performance on three commonly-used dimensions of text style transfer evaluation: style strength, content preservation, and fluency. We perform a comprehensive correlation analysis for two transfer directions (and overall) at different levels. Compared to existing automatic metrics, ChatGPT achieves competitive correlations with human judgments. These preliminary results are expected to provide a first glimpse into the role of LLMs in the multidimensional evaluation of stylized text generation.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Huiyuan Lai (17 papers)
  2. Antonio Toral (35 papers)
  3. Malvina Nissim (52 papers)
Citations (16)

Summary

We haven't generated a summary for this paper yet.