Sensitivity-Aware Amortized Bayesian Inference (2310.11122v6)
Abstract: Sensitivity analyses reveal the influence of various modeling choices on the outcomes of statistical analyses. While theoretically appealing, they are overwhelmingly inefficient for complex Bayesian models. In this work, we propose sensitivity-aware amortized Bayesian inference (SA-ABI), a multifaceted approach to efficiently integrate sensitivity analyses into simulation-based inference with neural networks. First, we utilize weight sharing to encode the structural similarities between alternative likelihood and prior specifications in the training process with minimal computational overhead. Second, we leverage the rapid inference of neural networks to assess sensitivity to data perturbations and preprocessing steps. In contrast to most other Bayesian approaches, both steps circumvent the costly bottleneck of refitting the model for each choice of likelihood, prior, or data set. Finally, we propose to use deep ensembles to detect sensitivity arising from unreliable approximation (e.g., due to model misspecification). We demonstrate the effectiveness of our method in applied modeling problems, ranging from disease outbreak dynamics and global warming thresholds to human decision-making. Our results support sensitivity-aware inference as a default choice for amortized Bayesian workflows, automatically providing modelers with insights into otherwise hidden dimensions.
- Syed Mumtaz Ali and Samuel D Silvey “A general class of coefficients of divergence of one distribution from another” In Journal of the Royal Statistical Society: Series B (Methodological) 28.1 Wiley Online Library, 1966, pp. 131–142
- “Neural networks enable efficient and accurate simulation-based inference of evolutionary parameters from adaptation dynamics” In PLoS Biology 20.5 Public Library of Science San Francisco, CA USA, 2022, pp. e3001633
- Oleksandr Balabanov, Bernhard Mehlig and Hampus Linander “Bayesian Posterior Approximation With Stochastic Ensembles” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE, 2023 DOI: 10.1109/cvpr52729.2023.01317
- Gilad Baruch, Moran Baruch and Yoav Goldberg “A Little Is Enough: Circumventing Defenses For Distributed Learning” In Advances in Neural Information Processing Systems 32 Curran Associates, Inc., 2019 URL: https://proceedings.neurips.cc/paper_files/paper/2019/file/ec1c59141046cd1866bbbcdfb6ae31d4-Paper.pdf
- Samuel J Bell and Onno P Kampman “Perspectives on Machine Learning from Psychology’s Reproducibility Crisis” In arXiv preprint arXiv:2104.08878, 2021
- “Modeling the Machine Learning Multiverse” In Advances in Neural Information Processing Systems 35, 2022, pp. 18416–18429
- “Curriculum learning” In Proceedings of the 26th annual international conference on machine learning, 2009, pp. 41–48
- José M Bernardo and Adrian FM Smith “Bayesian Theory” John Wiley & Sons, 2000
- “Julia: A fresh approach to numerical computing” In SIAM review 59.1 SIAM, 2017, pp. 65–98
- Battista Biggio, Blaine Nelson and Pavel Laskov “Poisoning Attacks against Support Vector Machines” In Proceedings of the 29th International Coference on International Conference on Machine Learning, ICML’12 Edinburgh, Scotland: Omnipress, 2012, pp. 1467–1474
- Benjamin Bloem-Reddy and Yee Whye Teh “Probabilistic Symmetries and Invariant Neural Networks.” In J. Mach. Learn. Res. 21, 2020, pp. 90–1
- “Estimating across-trial variability parameters of the Diffusion Decision Model: Expert advice and recommendations” In Journal of Mathematical Psychology 87, 2018, pp. 46–75
- Paul-Christian Bürkner, Maximilian Scholz and Stefan T Radev “Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy” In arXiv preprint arXiv:2209.02439, 2022
- Patrick Cannon, Daniel Ward and Sebastian M Schmon “Investigating the impact of model misspecification in neural simulation-based inference” In arXiv preprint arXiv:2209.01845, 2022
- “A likelihood-free inference framework for population genetic data using exchangeable neural networks” In Advances in neural information processing systems 31, 2018
- Kyle Cranmer, Johann Brehmer and Gilles Louppe “The frontier of simulation-based inference” In Proceedings of the National Academy of Sciences 117.48 National Acad Sciences, 2020, pp. 30055–30062
- Imre Csiszár “Eine informationstheoretische ungleichung und ihre anwendung auf beweis der ergodizitaet von markoffschen ketten” In Magyer Tud. Akad. Mat. Kutato Int. Koezl. 8, 1964, pp. 85–108
- “Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions” In Science 369.6500 American Association for the Advancement of Science, 2020, pp. eabb9789
- “Robust Bayesian inference for simulator-based models via the MMD posterior bootstrap” In International Conference on Artificial Intelligence and Statistics, 2022, pp. 943–970 PMLR
- Sarah Depaoli, Sonja D. Winter and Marieke Visser “The Importance of Prior Sensitivity Analysis in Bayesian Statistics: Demonstrations Using an Interactive Shiny App” In Frontiers in Psychology 11 Frontiers Media SA, 2020 DOI: 10.3389/fpsyg.2020.608045
- Noah S. Diffenbaugh and Elizabeth A. Barnes “Data-Driven Predictions of the Time Remaining until Critical Global Warming Thresholds Are Reached” In Proceedings of the National Academy of Sciences 120.6 Proceedings of the National Academy of Sciences, 2023, pp. e2207183120 DOI: 10.1073/pnas.2207183120
- “A deep learning method for comparing bayesian hierarchical models” In arXiv preprint arXiv:2301.11873, 2023
- “Bayesian Data Analysis (3rd Edition)” ChapmanHall/CRC, 2013
- Andrew Gelman, Jessica Hwang and Aki Vehtari “Understanding predictive information criteria for Bayesian models” In Statistics and computing 24.6 Springer, 2014, pp. 997–1016
- “Bayesian workflow” In arXiv preprint arXiv:2011.01808, 2020
- “A Swiss Army Infinitesimal Jackknife” In Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics 89, Proceedings of Machine Learning Research PMLR, 2019, pp. 1139–1147
- Manuel Glöckler, Michael Deistler and Jakob H Macke “Adversarial robustness of amortized Bayesian inference” In arXiv preprint arXiv:2305.14984, 2023
- Tilmann Gneiting and Adrian E Raftery “Strictly proper scoring rules, prediction, and estimation” In Journal of the American statistical Association 102.477 Taylor & Francis, 2007, pp. 359–378
- “Training deep neural density estimators to identify mechanistic models of neural dynamics” In Elife 9 eLife Sciences Publications Limited, 2020, pp. e56261
- Ian Goodfellow, Jonathon Shlens and Christian Szegedy “Explaining and Harnessing Adversarial Examples” In International Conference on Learning Representations, 2015 URL: http://arxiv.org/abs/1412.6572
- David Greenberg, Marcel Nonnenmacher and Jakob Macke “Automatic posterior transformation for likelihood-free inference” In International Conference on Machine Learning, 2019, pp. 2404–2414 PMLR
- “A Kernel Two-Sample Test” In The Journal of Machine Learning Research 13, 2012, pp. 723–773
- Ken Gu, Eunice Jun and Tim Althoff “Understanding and Supporting Debugging Workflows in Multiverse Analysis” In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems ACM, 2023 DOI: 10.1145/3544548.3581099
- “On calibration of modern neural networks” In International Conference on Machine Learning, 2017, pp. 1321–1330 PMLR
- “A Survey of Tasks and Visualizations in Multiverse Analysis Reports” In Computer Graphics Forum 41.1 Wiley, 2022, pp. 402–426 DOI: 10.1111/cgf.14443
- “The Potential to Narrow Uncertainty in Regional Climate Predictions” In Bulletin of the American Meteorological Society 90.8 American Meteorological Society, 2009, pp. 1095–1108 DOI: 10.1175/2009BAMS2607.1
- “Learning Robust Statistics for Simulation-based Inference under Model Misspecification” In arXiv preprint arXiv:2305.15871, 2023
- David Rios Insua and Fabrizio Ruggeri “Robust Bayesian Analysis” Berlin Heidelberg: Springer Science & Business Media, 2012
- “What are Bayesian neural network posteriors really like?” In International conference on machine learning, 2021, pp. 4629–4640 PMLR
- Louis A Jaeckel “The Infinitesimal Jackknife”, 1972
- “Projections of When Temperature Change Will Exceed 2 °C above Pre-Industrial Levels” In Nature Climate Change 1.8, 2011, pp. 407–412 DOI: 10.1038/nclimate1261
- “Detecting and diagnosing prior and likelihood sensitivity with power-scaling” In arXiv preprint arXiv:2107.14054, 2021
- Diederik P. Kingma and Jimmy Lei Ba “Adam: A method for stochastic optimization” In 3rd International Conference on Learning Representations, 2015, pp. 1–15
- Balaji Lakshminarayanan, Alexander Pritzel and Charles Blundell “Simple and scalable predictive uncertainty estimation using deep ensembles” In Advances in neural information processing systems 30, 2017
- “Future global climate: scenario-based projections and near-term information” In Climate change 2021: The physical science basis. Contribution of working group I to the sixth assessment report of the intergovernmental panel on climate change Cambridge University Press, 2021, pp. 553–672
- “Model complexity in diffusion modeling: Benefits of making the model more parsimonious” In Frontiers in Psychology 7.1324, 2016, pp. 1–14
- “On divergences and informations in statistics and information theory” In IEEE Transactions on Information Theory 52.10 IEEE, 2006, pp. 4394–4412
- Yang Liu, Tim Althoff and Jeffrey Heer “Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis” In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems ACM, 2020 DOI: 10.1145/3313831.3376533
- David MacKay “Information theory, inference and learning algorithms” Cambridge University Press, 2003
- “Tuning into the real effect of smartphone use on parenting: a multiverse analysis” In Journal of Child Psychology and Psychiatry 61.8 Wiley, 2020, pp. 855–865 DOI: 10.1111/jcpp.13282
- “Sensitivity Analysis of Simulation-Based Inference for Galaxy Clustering” In arXiv preprint arXiv:2309.15071, 2023
- “Density of states estimation for out of distribution detection” In International Conference on Artificial Intelligence and Statistics, 2021, pp. 3232–3240 PMLR
- “Detecting out-of-distribution inputs to deep generative models using typicality” In arXiv preprint arXiv:1906.02994, 2019
- Radford M Neal “MCMC using Hamiltonian dynamics” In Handbook of markov chain monte carlo ChapmanHall/CRC, 2011
- OSC “Estimating the reproducibility of psychological science” Published by the Open Science Collaboration (OSC) In Science 349.6251 American Association for the Advancement of Science (AAAS), 2015 DOI: 10.1126/science.aac4716
- “Generalized Bayesian likelihood-free inference using scoring rules estimators” In arXiv preprint arXiv:2104.03889, 2021
- “Deep Learning for Anomaly Detection” In ACM Computing Surveys 54.2 Association for Computing Machinery (ACM), 2021, pp. 1–38 DOI: 10.1145/3439950
- “Reliable ABC model choice via random forests” In Bioinformatics 32.6 Oxford University Press, 2016, pp. 859–866
- “BayesFlow: Learning complex stochastic models with invertible neural networks” In IEEE transactions on neural networks and learning systems IEEE, 2020
- “Amortized bayesian model comparison with evidential deep learning” In IEEE Transactions on Neural Networks and Learning Systems IEEE, 2021
- “OutbreakFlow: Model-based Bayesian inference of disease outbreak dynamics with invertible neural networks and its application to the COVID-19 pandemics in Germany” In PLoS computational biology Public Library of Science San Francisco, CA USA, 2021
- “BayesFlow: Amortized Bayesian Workflows With Neural Networks” In arXiv preprint arXiv:2306.16015, 2023
- “JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models” In Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence 216, Proceedings of Machine Learning Research PMLR, 2023, pp. 1695–1706
- Roger Ratcliff “A theory of memory retrieval.” In Psychological review 85.2 American Psychological Association, 1978, pp. 59
- “Diffusion decision model: Current issues and history” In Trends in cognitive sciences 20.4 Elsevier, 2016, pp. 260–281
- “The Shared Socioeconomic Pathways and Their Energy, Land Use, and Greenhouse Gas Emissions Implications: An Overview” In Global Environmental Change 42, 2017, pp. 153–168 DOI: 10.1016/j.gloenvcha.2016.05.009
- “Sensitivity Analysis for Bayesian Hierarchical Models” In Bayesian Analysis 10.2 International Society for Bayesian Analysis, 2015, pp. 321–349
- Teemu Säilynoja, Paul-Christian Bürkner and Aki Vehtari “Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison” In Statistics and Computing 32.2 Springer, 2022, pp. 1–21
- Marvin Schmitt, Paul-Christian Bürkner and Köthe “Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks” In Proceedings of the German Conference on Pattern Recognition (GCPR), 2023
- Cornelius Schröder and Jakob H Macke “Simultaneous identification of models and parameters of scientific simulators” In arXiv preprint arXiv:2305.15174, 2023
- Joseph P. Simmons, Leif D. Nelson and Uri Simonsohn “False-Positive Psychology” In Psychological Science 22.11 SAGE Publications, 2011, pp. 1359–1366 DOI: 10.1177/0956797611417632
- “Increasing Transparency Through a Multiverse Analysis” In Perspectives on Psychological Science 11.5 SAGE Publications, 2016, pp. 702–712 DOI: 10.1177/1745691616658637
- “Validating Bayesian inference algorithms with simulation-based calibration” In arXiv preprint arXiv:1804.06788, 2018
- Aleksei Tiulpin and Matthew B. Blaschko “Greedy Bayesian Posterior Approximation with Deep Ensembles” In Transactions on Machine Learning Research, 2022 URL: https://openreview.net/forum?id=P1DuPJzVTN
- Aki Vehtari, Andrew Gelman and Jonah Gabry “Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC” In Statistics and computing 27 Springer, 2017, pp. 1413–1432
- “Pareto smoothed importance sampling” In arXiv preprint arXiv:1507.02646, 2022
- “Sequential sampling models with variable boundaries and non-normal noise: A comparison of six models” In Psychonomic bulletin & review 26.3 Springer, 2019, pp. 813–832
- Eric-Jan Wagenmakers, Alexandra Sarafoglou and Balazs Aczel “One statistical analysis must not rule them all” In Nature 605.7910 Nature Publishing Group UK London, 2022, pp. 423–425
- “Robust neural posterior estimation and statistical model criticism” In Advances in Neural Information Processing Systems 35, 2022, pp. 33845–33859
- “Hyperparameter ensembles for robustness and uncertainty quantification” In Advances in Neural Information Processing Systems 33, 2020, pp. 6514–6527
- “Degrees of Freedom in Planning, Running, Analyzing, and Reporting Psychological Studies: A Checklist to Avoid p-Hacking” In Frontiers in Psychology 7 Frontiers Media SA, 2016 DOI: 10.3389/fpsyg.2016.01832
- Eva Marie Wieschen, Andreas Voss and Stefan Radev “Jumping to conclusion? a lévy flight model of decision making” In The Quantitative Methods for Psychology 16.2, 2020, pp. 120–132
- Andrew G Wilson and Pavel Izmailov “Bayesian deep learning and a probabilistic perspective of generalization” In Advances in neural information processing systems 33, 2020, pp. 4697–4708
- “Partially exchangeable networks and architectures for learning summary statistics in approximate Bayesian computation” In International Conference on Machine Learning, 2019, pp. 6798–6807 PMLR
- “Meta-Amortized Variational Inference and Learning” In Proceedings of the AAAI Conference on Artificial Intelligence 34.04 Association for the Advancement of Artificial Intelligence (AAAI), 2020, pp. 6404–6412 DOI: 10.1609/aaai.v34i04.6111
- “Deep sets” In Advances in neural information processing systems 30, 2017
- “Causes of Higher Climate Sensitivity in CMIP6 Models” In Geophysical Research Letters 47.1, 2020, pp. e2019GL085782 DOI: 10.1029/2019GL085782
- “A Comprehensive Survey on Transfer Learning” In Proceedings of the IEEE 109.1 Institute of ElectricalElectronics Engineers (IEEE), 2021, pp. 43–76 DOI: 10.1109/jproc.2020.3004555
- Janis H. Zickfeld and Thomas W. Schubert “How to Identify and How to Conduct Research that Is Informative and Reproducible” In Social Philosophy of Science for the Social Sciences Springer International Publishing, 2019, pp. 147–168 DOI: 10.1007/978-3-030-33099-6˙9
Collections
Sign up for free to add this paper to one or more collections.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.