Papers
Topics
Authors
Recent
AI Research Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 80 tok/s
Gemini 2.5 Pro 49 tok/s Pro
GPT-5 Medium 26 tok/s Pro
GPT-5 High 32 tok/s Pro
GPT-4o 92 tok/s Pro
Kimi K2 182 tok/s Pro
GPT OSS 120B 438 tok/s Pro
Claude Sonnet 4 38 tok/s Pro
2000 character limit reached

Structured Partial Stochasticity in Bayesian Neural Networks (2405.17666v2)

Published 27 May 2024 in stat.ML and cs.LG

Abstract: Bayesian neural network posterior distributions have a great number of modes that correspond to the same network function. The abundance of such modes can make it difficult for approximate inference methods to do their job. Recent work has demonstrated the benefits of partial stochasticity for approximate inference in Bayesian neural networks; inference can be less costly and performance can sometimes be improved. I propose a structured way to select the deterministic subset of weights that removes neuron permutation symmetries, and therefore the corresponding redundant posterior modes. With a drastically simplified posterior distribution, the performance of existing approximate inference schemes is found to be greatly improved.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (22)
  1. Git re-basin: Merging models modulo permutation symmetries, 2023.
  2. Weight uncertainty in neural networks, 2015.
  3. On the geometry of feedforward neural network error surfaces. Neural Computation, 5(6):910–927, 1993. 10.1162/neco.1993.5.6.910.
  4. Wide mean-field bayesian neural networks ignore the data, 2022.
  5. Bayesian deep learning via subnetwork inference. In Marina Meila and Tong Zhang, editors, Proceedings of the 38th International Conference on Machine Learning, volume 139 of Proceedings of Machine Learning Research, pages 2510–2521. PMLR, 18–24 Jul 2021. URL https://proceedings.mlr.press/v139/daxberger21a.html.
  6. Efficient and scalable bayesian neural nets with rank-1 factors, 2020.
  7. The role of permutation invariance in linear mode connectivity of neural networks, 2022.
  8. Auto-Encoding Variational Bayes. In 2nd International Conference on Learning Representations, ICLR 2014, Banff, AB, Canada, April 14-16, 2014, Conference Track Proceedings, 2014.
  9. Being bayesian, even just a bit, fixes overconfidence in relu networks, 2020.
  10. On symmetries in variational bayesian neural nets. In NeurIPS 2021 Workshop on Bayesian Deep Learning, 2021. URL https://www.amazon.science/publications/on-symmetries-in-variational-bayesian-neural-nets.
  11. On the detrimental effect of invariances in the likelihood for variational inference, 2022.
  12. Simple and scalable predictive uncertainty estimation using deep ensembles, 2017.
  13. Radford Neal. Bayesian learning via stochastic dynamics. In S. Hanson, J. Cowan, and C. Giles, editors, Advances in Neural Information Processing Systems, volume 5. Morgan-Kaufmann, 1992. URL https://proceedings.neurips.cc/paper_files/paper/1992/file/f29c21d4897f78948b91f03172341b7b-Paper.pdf.
  14. Benchmarking the neural linear model for regression, 2019.
  15. Position paper: Bayesian deep learning in the age of large-scale ai, 2024.
  16. Improving the identifiability of neural networks for bayesian inference. 2017. URL https://api.semanticscholar.org/CorpusID:46932278.
  17. On permutation symmetries in bayesian neural network posteriors: a variational perspective, 2023.
  18. Do bayesian neural networks need to be fully stochastic? In Francisco Ruiz, Jennifer Dy, and Jan-Willem van de Meent, editors, Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, volume 206 of Proceedings of Machine Learning Research, pages 7694–7722. PMLR, 25–27 Apr 2023. URL https://proceedings.mlr.press/v206/sharma23a.html.
  19. Connecting the dots: Is mode-connectedness the key to feasible sample-based inference in bayesian neural networks?, 2024.
  20. Energy Efficiency. UCI Machine Learning Repository, 2012. DOI: https://doi.org/10.24432/C51307.
  21. Towards efficient mcmc sampling in bayesian neural networks by exploiting symmetry, 2023.
  22. A compact representation for bayesian neural networks by removing permutation symmetry, 2023.

Summary

We haven't generated a summary for this paper yet.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 3 posts and received 15 likes.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube