Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency (2502.04964v4)

Published 7 Feb 2025 in cs.CL

Abstract: Uncertainty quantification (UQ) methods for LLMs encompass a variety of approaches, with two major types being particularly prominent: information-based, which focus on model confidence expressed as token probabilities, and consistency-based, which assess the semantic relationship between multiple outputs generated using repeated sampling. Several recent methods have combined these two approaches to boost UQ performance. However, they sometimes fail to outperform much simpler baseline methods. Our work discusses the fundamental approach to constructing uncertainty measures that directly links uncertainty with the minimum Bayes risks achieved by LLM decoding. Building on these findings, we propose a novel approach to integrating model confidence with output consistency, resulting in a family of efficient and robust UQ methods. Our investigation reveals distinctive characteristics of LLMs as probabilistic models, which help to explain why these UQ methods underperform in certain tasks. Based on these findings, we propose a new way of synthesizing model confidence and output consistency, leading to a family of efficient and robust UQ methods. We evaluate our approach across various tasks such as question answering, abstractive summarization, and machine translation, demonstrating sizable improvements over state-of-the-art UQ approaches.

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency (2502.04964v4)

Collections

Summary

Paper Prompts

Follow-up Questions

Authors (7)

Don't miss out on important new AI/ML research

Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency (2502.04964v4)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (7)

Don't miss out on important new AI/ML research