Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA (2502.12122v1)

Published 17 Feb 2025 in cs.LG

Abstract: Low-Rank Adaptation (LoRA) enables parameter-efficient fine-tuning of LLMs by decomposing weight updates into low-rank matrices, significantly reducing storage and computational overhead. While effective, standard LoRA lacks mechanisms for uncertainty quantification, leading to overconfident and poorly calibrated models. Bayesian variants of LoRA address this limitation, but at the cost of a significantly increased number of trainable parameters, partially offsetting the original efficiency gains. Additionally, these models are harder to train and may suffer from unstable convergence. In this work, we propose a novel parameter-efficient Bayesian LoRA, demonstrating that effective uncertainty quantification can be achieved in very low-dimensional parameter spaces. The proposed method achieves strong performance with improved calibration and generalization while maintaining computational efficiency. Our empirical findings show that, with the appropriate projection of the weight space: (1) uncertainty can be effectively modeled in a low-dimensional space, and (2) weight covariances exhibit low ranks.

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Minimal Ranks, Maximum Confidence: Parameter-efficient Uncertainty Quantification for LoRA (2502.12122v1)

Summary

Follow-up Questions

Related Papers

Authors (4)