PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics (2412.16120v1)

Published 20 Dec 2024 in cs.CL

Abstract: Evaluating the quality of machine-generated natural language content is a challenging task in NLP. Recently, LLMs like GPT-4 have been employed for this purpose, but they are computationally expensive due to the extensive token usage required by complex evaluation prompts. In this paper, we propose a prompt optimization approach that uses a smaller, fine-tuned LLM to compress input data for evaluation prompt, thus reducing token usage and computational cost when using larger LLMs for downstream evaluation. Our method involves a two-stage fine-tuning process: supervised fine-tuning followed by preference optimization to refine the model's outputs based on human preferences. We focus on Machine Translation (MT) evaluation and utilize the GEMBA-MQM metric as a starting point. Our results show a $2.37\times$ reduction in token usage without any loss in evaluation quality. This work makes state-of-the-art LLM-based metrics like GEMBA-MQM more cost-effective and efficient, enhancing their accessibility for broader use.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics (2412.16120v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (2)

Don't miss out on important new AI/ML research

PromptOptMe: Error-Aware Prompt Compression for LLM-based MT Evaluation Metrics (2412.16120v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (2)

Don't miss out on important new AI/ML research