Conformal Prediction with Large Language Models for Multi-Choice Question Answering (2305.18404v3)

Published 28 May 2023 in cs.CL, cs.LG, and stat.ML

Abstract: As LLMs continue to be widely developed, robust uncertainty quantification techniques will become crucial for their safe deployment in high-stakes scenarios. In this work, we explore how conformal prediction can be used to provide uncertainty quantification in LLMs for the specific task of multiple-choice question-answering. We find that the uncertainty estimates from conformal prediction are tightly correlated with prediction accuracy. This observation can be useful for downstream applications such as selective classification and filtering out low-quality predictions. We also investigate the exchangeability assumption required by conformal prediction to out-of-subject questions, which may be a more realistic scenario for many practical applications. Our work contributes towards more trustworthy and reliable usage of LLMs in safety-critical situations, where robust guarantees of error rate are required.

PDF HTML Abstract

Summarize Bookmark Chat (Pro)

References (25)

Authors (7)

Bhawesh Kumar (6 papers)
Charlie Lu (1 paper)
Gauri Gupta (8 papers)
Anil Palepu (12 papers)
David Bellamy (2 papers)
Ramesh Raskar (123 papers)
Andrew Beam (9 papers)

Citations (49)

View on Semantic Scholar

Tweets

https://twitter.com/AleksanderMolak/status/1845158909956264376

Conformal Prediction with Large Language Models for Multi-Choice Question Answering (2305.18404v3)

Related Papers

Tweets