Uncertainty quantification in fine-tuned LLMs using LoRA ensembles (2402.12264v2)

Published 19 Feb 2024 in cs.LG, cs.AI, cs.CL, and stat.ML

Abstract: Fine-tuning LLMs can improve task specific performance, although a general understanding of what the fine-tuned model has learned, forgotten and how to trust its predictions is still missing. We derive principled uncertainty quantification for fine-tuned LLMs with posterior approximations using computationally efficient low-rank adaptation ensembles. We analyze three common multiple-choice datasets using low-rank adaptation ensembles based on Mistral-7b, and draw quantitative and qualitative conclusions on their perceived complexity and balance between retained prior knowledge and domain specific adaptation during and after fine-tuning. We identify unexpected retention of acquired knowledge during fine-tuning in the overfitting regime.

References (82)

Citations (8)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Uncertainty quantification in fine-tuned LLMs using LoRA ensembles (2402.12264v2)

Collections

Summary

Follow-up Questions

Authors (2)

Tweets

Uncertainty quantification in fine-tuned LLMs using LoRA ensembles (2402.12264v2)

Collections

Summary

Follow-up Questions

Related Papers

Authors (2)

Tweets