Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash 87 tok/s
Gemini 2.5 Pro 60 tok/s Pro
GPT-5 Medium 28 tok/s
GPT-5 High 28 tok/s Pro
GPT-4o 81 tok/s
GPT OSS 120B 453 tok/s Pro
Kimi K2 229 tok/s Pro
2000 character limit reached

Sensitivity-Aware Amortized Bayesian Inference (2310.11122v6)

Published 17 Oct 2023 in stat.ML, cs.LG, and stat.ME

Abstract: Sensitivity analyses reveal the influence of various modeling choices on the outcomes of statistical analyses. While theoretically appealing, they are overwhelmingly inefficient for complex Bayesian models. In this work, we propose sensitivity-aware amortized Bayesian inference (SA-ABI), a multifaceted approach to efficiently integrate sensitivity analyses into simulation-based inference with neural networks. First, we utilize weight sharing to encode the structural similarities between alternative likelihood and prior specifications in the training process with minimal computational overhead. Second, we leverage the rapid inference of neural networks to assess sensitivity to data perturbations and preprocessing steps. In contrast to most other Bayesian approaches, both steps circumvent the costly bottleneck of refitting the model for each choice of likelihood, prior, or data set. Finally, we propose to use deep ensembles to detect sensitivity arising from unreliable approximation (e.g., due to model misspecification). We demonstrate the effectiveness of our method in applied modeling problems, ranging from disease outbreak dynamics and global warming thresholds to human decision-making. Our results support sensitivity-aware inference as a default choice for amortized Bayesian workflows, automatically providing modelers with insights into otherwise hidden dimensions.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (89)
  1. Syed Mumtaz Ali and Samuel D Silvey “A general class of coefficients of divergence of one distribution from another” In Journal of the Royal Statistical Society: Series B (Methodological) 28.1 Wiley Online Library, 1966, pp. 131–142
  2. “Neural networks enable efficient and accurate simulation-based inference of evolutionary parameters from adaptation dynamics” In PLoS Biology 20.5 Public Library of Science San Francisco, CA USA, 2022, pp. e3001633
  3. Oleksandr Balabanov, Bernhard Mehlig and Hampus Linander “Bayesian Posterior Approximation With Stochastic Ensembles” In 2023 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) IEEE, 2023 DOI: 10.1109/cvpr52729.2023.01317
  4. Gilad Baruch, Moran Baruch and Yoav Goldberg “A Little Is Enough: Circumventing Defenses For Distributed Learning” In Advances in Neural Information Processing Systems 32 Curran Associates, Inc., 2019 URL: https://proceedings.neurips.cc/paper_files/paper/2019/file/ec1c59141046cd1866bbbcdfb6ae31d4-Paper.pdf
  5. Samuel J Bell and Onno P Kampman “Perspectives on Machine Learning from Psychology’s Reproducibility Crisis” In arXiv preprint arXiv:2104.08878, 2021
  6. “Modeling the Machine Learning Multiverse” In Advances in Neural Information Processing Systems 35, 2022, pp. 18416–18429
  7. “Curriculum learning” In Proceedings of the 26th annual international conference on machine learning, 2009, pp. 41–48
  8. José M Bernardo and Adrian FM Smith “Bayesian Theory” John Wiley & Sons, 2000
  9. “Julia: A fresh approach to numerical computing” In SIAM review 59.1 SIAM, 2017, pp. 65–98
  10. Battista Biggio, Blaine Nelson and Pavel Laskov “Poisoning Attacks against Support Vector Machines” In Proceedings of the 29th International Coference on International Conference on Machine Learning, ICML’12 Edinburgh, Scotland: Omnipress, 2012, pp. 1467–1474
  11. Benjamin Bloem-Reddy and Yee Whye Teh “Probabilistic Symmetries and Invariant Neural Networks.” In J. Mach. Learn. Res. 21, 2020, pp. 90–1
  12. “Estimating across-trial variability parameters of the Diffusion Decision Model: Expert advice and recommendations” In Journal of Mathematical Psychology 87, 2018, pp. 46–75
  13. Paul-Christian Bürkner, Maximilian Scholz and Stefan T Radev “Some models are useful, but how do we know which ones? Towards a unified Bayesian model taxonomy” In arXiv preprint arXiv:2209.02439, 2022
  14. Patrick Cannon, Daniel Ward and Sebastian M Schmon “Investigating the impact of model misspecification in neural simulation-based inference” In arXiv preprint arXiv:2209.01845, 2022
  15. “A likelihood-free inference framework for population genetic data using exchangeable neural networks” In Advances in neural information processing systems 31, 2018
  16. Kyle Cranmer, Johann Brehmer and Gilles Louppe “The frontier of simulation-based inference” In Proceedings of the National Academy of Sciences 117.48 National Acad Sciences, 2020, pp. 30055–30062
  17. Imre Csiszár “Eine informationstheoretische ungleichung und ihre anwendung auf beweis der ergodizitaet von markoffschen ketten” In Magyer Tud. Akad. Mat. Kutato Int. Koezl. 8, 1964, pp. 85–108
  18. “Inferring change points in the spread of COVID-19 reveals the effectiveness of interventions” In Science 369.6500 American Association for the Advancement of Science, 2020, pp. eabb9789
  19. “Robust Bayesian inference for simulator-based models via the MMD posterior bootstrap” In International Conference on Artificial Intelligence and Statistics, 2022, pp. 943–970 PMLR
  20. Sarah Depaoli, Sonja D. Winter and Marieke Visser “The Importance of Prior Sensitivity Analysis in Bayesian Statistics: Demonstrations Using an Interactive Shiny App” In Frontiers in Psychology 11 Frontiers Media SA, 2020 DOI: 10.3389/fpsyg.2020.608045
  21. Noah S. Diffenbaugh and Elizabeth A. Barnes “Data-Driven Predictions of the Time Remaining until Critical Global Warming Thresholds Are Reached” In Proceedings of the National Academy of Sciences 120.6 Proceedings of the National Academy of Sciences, 2023, pp. e2207183120 DOI: 10.1073/pnas.2207183120
  22. “A deep learning method for comparing bayesian hierarchical models” In arXiv preprint arXiv:2301.11873, 2023
  23. “Bayesian Data Analysis (3rd Edition)” ChapmanHall/CRC, 2013
  24. Andrew Gelman, Jessica Hwang and Aki Vehtari “Understanding predictive information criteria for Bayesian models” In Statistics and computing 24.6 Springer, 2014, pp. 997–1016
  25. “Bayesian workflow” In arXiv preprint arXiv:2011.01808, 2020
  26. “A Swiss Army Infinitesimal Jackknife” In Proceedings of the Twenty-Second International Conference on Artificial Intelligence and Statistics 89, Proceedings of Machine Learning Research PMLR, 2019, pp. 1139–1147
  27. Manuel Glöckler, Michael Deistler and Jakob H Macke “Adversarial robustness of amortized Bayesian inference” In arXiv preprint arXiv:2305.14984, 2023
  28. Tilmann Gneiting and Adrian E Raftery “Strictly proper scoring rules, prediction, and estimation” In Journal of the American statistical Association 102.477 Taylor & Francis, 2007, pp. 359–378
  29. “Training deep neural density estimators to identify mechanistic models of neural dynamics” In Elife 9 eLife Sciences Publications Limited, 2020, pp. e56261
  30. Ian Goodfellow, Jonathon Shlens and Christian Szegedy “Explaining and Harnessing Adversarial Examples” In International Conference on Learning Representations, 2015 URL: http://arxiv.org/abs/1412.6572
  31. David Greenberg, Marcel Nonnenmacher and Jakob Macke “Automatic posterior transformation for likelihood-free inference” In International Conference on Machine Learning, 2019, pp. 2404–2414 PMLR
  32. “A Kernel Two-Sample Test” In The Journal of Machine Learning Research 13, 2012, pp. 723–773
  33. Ken Gu, Eunice Jun and Tim Althoff “Understanding and Supporting Debugging Workflows in Multiverse Analysis” In Proceedings of the 2023 CHI Conference on Human Factors in Computing Systems ACM, 2023 DOI: 10.1145/3544548.3581099
  34. “On calibration of modern neural networks” In International Conference on Machine Learning, 2017, pp. 1321–1330 PMLR
  35. “A Survey of Tasks and Visualizations in Multiverse Analysis Reports” In Computer Graphics Forum 41.1 Wiley, 2022, pp. 402–426 DOI: 10.1111/cgf.14443
  36. “The Potential to Narrow Uncertainty in Regional Climate Predictions” In Bulletin of the American Meteorological Society 90.8 American Meteorological Society, 2009, pp. 1095–1108 DOI: 10.1175/2009BAMS2607.1
  37. “Learning Robust Statistics for Simulation-based Inference under Model Misspecification” In arXiv preprint arXiv:2305.15871, 2023
  38. David Rios Insua and Fabrizio Ruggeri “Robust Bayesian Analysis” Berlin Heidelberg: Springer Science & Business Media, 2012
  39. “What are Bayesian neural network posteriors really like?” In International conference on machine learning, 2021, pp. 4629–4640 PMLR
  40. Louis A Jaeckel “The Infinitesimal Jackknife”, 1972
  41. “Projections of When Temperature Change Will Exceed 2 °C above Pre-Industrial Levels” In Nature Climate Change 1.8, 2011, pp. 407–412 DOI: 10.1038/nclimate1261
  42. “Detecting and diagnosing prior and likelihood sensitivity with power-scaling” In arXiv preprint arXiv:2107.14054, 2021
  43. Diederik P. Kingma and Jimmy Lei Ba “Adam: A method for stochastic optimization” In 3rd International Conference on Learning Representations, 2015, pp. 1–15
  44. Balaji Lakshminarayanan, Alexander Pritzel and Charles Blundell “Simple and scalable predictive uncertainty estimation using deep ensembles” In Advances in neural information processing systems 30, 2017
  45. “Future global climate: scenario-based projections and near-term information” In Climate change 2021: The physical science basis. Contribution of working group I to the sixth assessment report of the intergovernmental panel on climate change Cambridge University Press, 2021, pp. 553–672
  46. “Model complexity in diffusion modeling: Benefits of making the model more parsimonious” In Frontiers in Psychology 7.1324, 2016, pp. 1–14
  47. “On divergences and informations in statistics and information theory” In IEEE Transactions on Information Theory 52.10 IEEE, 2006, pp. 4394–4412
  48. Yang Liu, Tim Althoff and Jeffrey Heer “Paths Explored, Paths Omitted, Paths Obscured: Decision Points & Selective Reporting in End-to-End Data Analysis” In Proceedings of the 2020 CHI Conference on Human Factors in Computing Systems ACM, 2020 DOI: 10.1145/3313831.3376533
  49. David MacKay “Information theory, inference and learning algorithms” Cambridge University Press, 2003
  50. “Tuning into the real effect of smartphone use on parenting: a multiverse analysis” In Journal of Child Psychology and Psychiatry 61.8 Wiley, 2020, pp. 855–865 DOI: 10.1111/jcpp.13282
  51. “Sensitivity Analysis of Simulation-Based Inference for Galaxy Clustering” In arXiv preprint arXiv:2309.15071, 2023
  52. “Density of states estimation for out of distribution detection” In International Conference on Artificial Intelligence and Statistics, 2021, pp. 3232–3240 PMLR
  53. “Detecting out-of-distribution inputs to deep generative models using typicality” In arXiv preprint arXiv:1906.02994, 2019
  54. Radford M Neal “MCMC using Hamiltonian dynamics” In Handbook of markov chain monte carlo ChapmanHall/CRC, 2011
  55. OSC “Estimating the reproducibility of psychological science” Published by the Open Science Collaboration (OSC) In Science 349.6251 American Association for the Advancement of Science (AAAS), 2015 DOI: 10.1126/science.aac4716
  56. “Generalized Bayesian likelihood-free inference using scoring rules estimators” In arXiv preprint arXiv:2104.03889, 2021
  57. “Deep Learning for Anomaly Detection” In ACM Computing Surveys 54.2 Association for Computing Machinery (ACM), 2021, pp. 1–38 DOI: 10.1145/3439950
  58. “Reliable ABC model choice via random forests” In Bioinformatics 32.6 Oxford University Press, 2016, pp. 859–866
  59. “BayesFlow: Learning complex stochastic models with invertible neural networks” In IEEE transactions on neural networks and learning systems IEEE, 2020
  60. “Amortized bayesian model comparison with evidential deep learning” In IEEE Transactions on Neural Networks and Learning Systems IEEE, 2021
  61. “OutbreakFlow: Model-based Bayesian inference of disease outbreak dynamics with invertible neural networks and its application to the COVID-19 pandemics in Germany” In PLoS computational biology Public Library of Science San Francisco, CA USA, 2021
  62. “BayesFlow: Amortized Bayesian Workflows With Neural Networks” In arXiv preprint arXiv:2306.16015, 2023
  63. “JANA: Jointly Amortized Neural Approximation of Complex Bayesian Models” In Proceedings of the 39th Conference on Uncertainty in Artificial Intelligence 216, Proceedings of Machine Learning Research PMLR, 2023, pp. 1695–1706
  64. Roger Ratcliff “A theory of memory retrieval.” In Psychological review 85.2 American Psychological Association, 1978, pp. 59
  65. “Diffusion decision model: Current issues and history” In Trends in cognitive sciences 20.4 Elsevier, 2016, pp. 260–281
  66. “The Shared Socioeconomic Pathways and Their Energy, Land Use, and Greenhouse Gas Emissions Implications: An Overview” In Global Environmental Change 42, 2017, pp. 153–168 DOI: 10.1016/j.gloenvcha.2016.05.009
  67. “Sensitivity Analysis for Bayesian Hierarchical Models” In Bayesian Analysis 10.2 International Society for Bayesian Analysis, 2015, pp. 321–349
  68. Teemu Säilynoja, Paul-Christian Bürkner and Aki Vehtari “Graphical test for discrete uniformity and its applications in goodness-of-fit evaluation and multiple sample comparison” In Statistics and Computing 32.2 Springer, 2022, pp. 1–21
  69. Marvin Schmitt, Paul-Christian Bürkner and Köthe “Detecting Model Misspecification in Amortized Bayesian Inference with Neural Networks” In Proceedings of the German Conference on Pattern Recognition (GCPR), 2023
  70. Cornelius Schröder and Jakob H Macke “Simultaneous identification of models and parameters of scientific simulators” In arXiv preprint arXiv:2305.15174, 2023
  71. Joseph P. Simmons, Leif D. Nelson and Uri Simonsohn “False-Positive Psychology” In Psychological Science 22.11 SAGE Publications, 2011, pp. 1359–1366 DOI: 10.1177/0956797611417632
  72. “Increasing Transparency Through a Multiverse Analysis” In Perspectives on Psychological Science 11.5 SAGE Publications, 2016, pp. 702–712 DOI: 10.1177/1745691616658637
  73. “Validating Bayesian inference algorithms with simulation-based calibration” In arXiv preprint arXiv:1804.06788, 2018
  74. Aleksei Tiulpin and Matthew B. Blaschko “Greedy Bayesian Posterior Approximation with Deep Ensembles” In Transactions on Machine Learning Research, 2022 URL: https://openreview.net/forum?id=P1DuPJzVTN
  75. Aki Vehtari, Andrew Gelman and Jonah Gabry “Practical Bayesian model evaluation using leave-one-out cross-validation and WAIC” In Statistics and computing 27 Springer, 2017, pp. 1413–1432
  76. “Pareto smoothed importance sampling” In arXiv preprint arXiv:1507.02646, 2022
  77. “Sequential sampling models with variable boundaries and non-normal noise: A comparison of six models” In Psychonomic bulletin & review 26.3 Springer, 2019, pp. 813–832
  78. Eric-Jan Wagenmakers, Alexandra Sarafoglou and Balazs Aczel “One statistical analysis must not rule them all” In Nature 605.7910 Nature Publishing Group UK London, 2022, pp. 423–425
  79. “Robust neural posterior estimation and statistical model criticism” In Advances in Neural Information Processing Systems 35, 2022, pp. 33845–33859
  80. “Hyperparameter ensembles for robustness and uncertainty quantification” In Advances in Neural Information Processing Systems 33, 2020, pp. 6514–6527
  81. “Degrees of Freedom in Planning, Running, Analyzing, and Reporting Psychological Studies: A Checklist to Avoid p-Hacking” In Frontiers in Psychology 7 Frontiers Media SA, 2016 DOI: 10.3389/fpsyg.2016.01832
  82. Eva Marie Wieschen, Andreas Voss and Stefan Radev “Jumping to conclusion? a lévy flight model of decision making” In The Quantitative Methods for Psychology 16.2, 2020, pp. 120–132
  83. Andrew G Wilson and Pavel Izmailov “Bayesian deep learning and a probabilistic perspective of generalization” In Advances in neural information processing systems 33, 2020, pp. 4697–4708
  84. “Partially exchangeable networks and architectures for learning summary statistics in approximate Bayesian computation” In International Conference on Machine Learning, 2019, pp. 6798–6807 PMLR
  85. “Meta-Amortized Variational Inference and Learning” In Proceedings of the AAAI Conference on Artificial Intelligence 34.04 Association for the Advancement of Artificial Intelligence (AAAI), 2020, pp. 6404–6412 DOI: 10.1609/aaai.v34i04.6111
  86. “Deep sets” In Advances in neural information processing systems 30, 2017
  87. “Causes of Higher Climate Sensitivity in CMIP6 Models” In Geophysical Research Letters 47.1, 2020, pp. e2019GL085782 DOI: 10.1029/2019GL085782
  88. “A Comprehensive Survey on Transfer Learning” In Proceedings of the IEEE 109.1 Institute of ElectricalElectronics Engineers (IEEE), 2021, pp. 43–76 DOI: 10.1109/jproc.2020.3004555
  89. Janis H. Zickfeld and Thomas W. Schubert “How to Identify and How to Conduct Research that Is Informative and Reproducible” In Social Philosophy of Science for the Social Sciences Springer International Publishing, 2019, pp. 147–168 DOI: 10.1007/978-3-030-33099-6˙9
Citations (3)
List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

Summary

We haven't generated a summary for this paper yet.

Ai Generate Text Spark Streamline Icon: https://streamlinehq.com

Paper Prompts

Sign up for free to create and run prompts on this paper using GPT-5.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.