Batch, match, and patch: low-rank approximations for score-based variational inference (2410.22292v2)
Abstract: Black-box variational inference (BBVI) scales poorly to high-dimensional problems when it is used to estimate a multivariate Gaussian approximation with a full covariance matrix. In this paper, we extend the batch-and-match (BaM) framework for score-based BBVI to problems where it is prohibitively expensive to store such covariance matrices, let alone to estimate them. Unlike classical algorithms for BBVI, which use stochastic gradient descent to minimize the reverse Kullback-Leibler divergence, BaM uses more specialized updates to match the scores of the target density and its Gaussian approximation. We extend the updates for BaM by integrating them with a more compact parameterization of full covariance matrices. In particular, borrowing ideas from factor analysis, we add an extra step to each iteration of BaM--a patch--that projects each newly updated covariance matrix into a more efficiently parameterized family of diagonal plus low rank matrices. We evaluate this approach on a variety of synthetic target distributions and real-world problems in high-dimensional inference.
- Statistical and computational trade-offs in variational inference: A case study in inferential model selection. arXiv eprint 2207.11208, 2022.
- Variational inference: A review for statisticians. Journal of the American Statistical Association, 112(518):859–877, 2017.
- JAX: composable transformations of Python+NumPy programs, 2018.
- EigenVI: score-based variational inference with orthogonal function expansions. In Advances in Neural Information Processing Systems, to appear, 2024a.
- Batch and match: black-box variational inference with a score-based divergence. In International Conference on Machine Learning, 2024b.
- Hierarchical Bayesian analysis of changepoint problems. Journal of the Royal Statistical Society: Series C (Applied Statistics), 41(2):389–405, 1992.
- A. Gelman and J. Hill. Data analysis using regression and multilevel/hierarchical models. Cambridge University Press, 2006.
- Z. Ghahramani and G. E. Hinton. The EM algorithm for mixtures of factor analyzers. Technical report, Technical Report CRG-TR-96-1, University of Toronto, 1996.
- Structured stochastic variational inference. In Artificial Intelligence and Statistics, pages 361–369, 2015.
- An introduction to variational methods for graphical models. Machine Learning, 37:183–233, 1999.
- D. P. Kingma and J. Ba. Adam: A method for stochastic optimization. In International Conference on Learning Representations, 2015.
- D. P. Kingma and M. Welling. Auto-encoding variational Bayes. In International Conference on Learning Representations, 2014.
- Provably scalable black-box variational inference with structured variational families. In International Conference on Machine Learning, 2024.
- Automatic differentiation variational inference. Journal of Machine Learning Research, 2017.
- Variational boosting: Iteratively refining posterior approximations. In International Conference on Machine Learning, pages 2420–2429. PMLR, 2017.
- Variational inference with Gaussian score matching. Advances in Neural Information Processing Systems, 36, 2023.
- Log Gaussian Cox processes. Scandinavian Journal of Statistics, 25(3):451–482, 1998.
- Elliptical slice sampling. In Artificial Intelligence and Statistics, volume 9, pages 541–548, 2010.
- Automated variational inference for Gaussian process models. Advances in Neural Information Processing Systems, 27, 2014.
- Gaussian variational approximation with a factor covariance structure. Journal of Computational and Graphical Statistics, 27(3):465–478, 2018.
- The matrix cookbook. Technical University of Denmark, 7(15):510, 2008.
- Black box variational inference. In Artificial Intelligence and Statistics, pages 814–822. PMLR, 2014.
- EM algorithms for ML factor analysis. Psychometrika, 47:69–76, 1982.
- L. Saul and M. Jordan. Exploiting tractable substructures in intractable networks. Advances in Neural Information processing Systems, 8, 1995.
- Maximum likelihood and minimum classification error factor analysis for automatic speech recognition. IEEE Transactions on Speech and Audio Processing, 8(2):115–125, 2000.
- Gaussian variational approximation with sparse precision matrices. Statistics and Computing, 28:259–275, 2018.
- M. Titsias and M. Lázaro-Gredilla. Doubly stochastic variational Bayes for non-conjugate inference. In International Conference on Machine Learning. PMLR, 2014.
- Efficient low rank Gaussian variational inference for neural networks. Advances in Neural Information Processing Systems, 33:4610–4622, 2020.
- Graphical models, exponential families, and variational inference. Foundations and Trends® in Machine Learning, 1(1–2):1–305, 2008.
- Variational approximations using Fisher divergence. arXiv preprint arXiv:1905.05284, 2019.
- Variational Hamiltonian Monte Carlo via score matching. Bayesian Analysis, 13(2):485, 2018.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.