Bayesian Low-rank Adaptation for Large Language Models (2308.13111v5)

Published 24 Aug 2023 in cs.LG

Abstract: Low-rank adaptation (LoRA) has emerged as a new paradigm for cost-efficient fine-tuning of LLMs. However, fine-tuned LLMs often become overconfident especially when fine-tuned on small datasets. Bayesian methods, with their inherent ability to estimate uncertainty, serve as potent tools to mitigate overconfidence and enhance calibration. In this work, we introduce Laplace-LoRA, which applies a Bayesian approach to the LoRA parameters. Specifically, Laplace-LoRA applies a Laplace approximation to the posterior over the LoRA parameters, considerably improving the calibration of fine-tuned LLMs.

PDF HTML Abstract

Summarize PDF Markdown Bookmark Chat (Pro)

References (72)

Authors (4)

Adam X. Yang (6 papers)
Maxime Robeyns (6 papers)
Xi Wang (275 papers)
Laurence Aitchison (66 papers)

Citations (30)

View on Semantic Scholar

Tweets

https://twitter.com/adam_x_yang/status/1757135327708282915

https://twitter.com/3rp3l/status/1785803593720664161

https://twitter.com/laurence_ai/status/1799006131077198226

https://twitter.com/maxime_robeyns/status/1787049759334605177

https://twitter.com/maxime_robeyns/status/1757318592327024889

Bayesian Low-rank Adaptation for Large Language Models (2308.13111v5)

Related Papers

Tweets