Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo (2401.11665v3)

Published 22 Jan 2024 in stat.ML, cs.AI, and cs.LG

Abstract: Approximate Thompson sampling with Langevin Monte Carlo broadens its reach from Gaussian posterior sampling to encompass more general smooth posteriors. However, it still encounters scalability issues in high-dimensional problems when demanding high accuracy. To address this, we propose an approximate Thompson sampling strategy, utilizing underdamped Langevin Monte Carlo, where the latter is the go-to workhorse for simulations of high-dimensional posteriors. Based on the standard smoothness and log-concavity conditions, we study the accelerated posterior concentration and sampling using a specific potential function. This design improves the sample complexity for realizing logarithmic regrets from $\mathcal{\tilde O}(d)$ to $\mathcal{\tilde O}(\sqrt{d})$. The scalability and robustness of our algorithm are also empirically validated through synthetic experiments in high-dimensional bandit problems.

Authors (4)

Haoyang Zheng (8 papers)
Wei Deng (65 papers)
Christian Moya (19 papers)
Guang Lin (128 papers)

Citations (4)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Tweets

https://twitter.com/StatMLPapers/status/1749640355975205268

Accelerating Approximate Thompson Sampling with Underdamped Langevin Monte Carlo (2401.11665v3)

Summary

Related Papers

Tweets