ASPIRE: Iterative Amortized Posterior Inference for Bayesian Inverse Problems (2405.05398v1)
Abstract: Due to their uncertainty quantification, Bayesian solutions to inverse problems are the framework of choice in applications that are risk averse. These benefits come at the cost of computations that are in general, intractable. New advances in machine learning and variational inference (VI) have lowered the computational barrier by learning from examples. Two VI paradigms have emerged that represent different tradeoffs: amortized and non-amortized. Amortized VI can produce fast results but due to generalizing to many observed datasets it produces suboptimal inference results. Non-amortized VI is slower at inference but finds better posterior approximations since it is specialized towards a single observed dataset. Current amortized VI techniques run into a sub-optimality wall that can not be improved without more expressive neural networks or extra training data. We present a solution that enables iterative improvement of amortized posteriors that uses the same networks architectures and training data. The benefits of our method requires extra computations but these remain frugal since they are based on physics-hybrid methods and summary statistics. Importantly, these computations remain mostly offline thus our method maintains cheap and reusable online evaluation while bridging the approximation gap these two paradigms. We denote our proposed method ASPIRE - Amortized posteriors with Summaries that are Physics-based and Iteratively REfined. We first validate our method on a stylized problem with a known posterior then demonstrate its practical use on a high-dimensional and nonlinear transcranial medical imaging problem with ultrasound. Compared with the baseline and previous methods from the literature our method stands out as an computationally efficient and high-fidelity method for posterior inference.
- Hadamard J 1902 Princeton university bulletin 49–52
- Tarantola A 2005 Inverse problem theory and methods for model parameter estimation (SIAM)
- Curtis A and Lomax A 2001 Geophysics 66 372–378
- Bishop C M and Nasrabadi N M 2006 Pattern recognition and machine learning vol 4 (Springer)
- Robert C P, Casella G and Casella G 1999 Monte Carlo statistical methods vol 2 (Springer)
- Siahkoohi A, Rizzuti G and Herrmann F J 2022 Geophysics 87 S281–S302
- Dempster A P, Laird N M and Rubin D B 1977 Journal of the royal statistical society: series B (methodological) 39 1–22
- Tierney L and Kadane J B 1986 Journal of the american statistical association 82–86
- Kullback S and Leibler R A 1951 The annals of mathematical statistics 22 79–86
- Welling M and Teh Y W 2011 Bayesian learning via stochastic gradient langevin dynamics Proceedings of the 28th international conference on machine learning (ICML-11) (Citeseer) pp 681–688
- Sun H and Bouman K L 2021 Deep probabilistic imaging: Uncertainty quantification and multi-modal solution characterization for computational imaging Proceedings of the AAAI Conference on Artificial Intelligence vol 35 pp 2628–2637
- Liu Q and Wang D 2016 Advances in neural information processing systems 29
- Cremer C, Li X and Duvenaud D 2018 Inference suboptimality in variational autoencoders International Conference on Machine Learning (PMLR) pp 1078–1086
- Putzky P et al. 2023
- Marino J, Yue Y and Mandt S 2018 Iterative amortized inference International Conference on Machine Learning (PMLR) pp 3403–3412
- Whang J, Lindgren E and Dimakis A 2021 Composing normalizing flows for inverse problems International Conference on Machine Learning (PMLR) pp 11158–11169
- Grcić M, Grubišić I and Šegvić S 2021 Advances in Neural Information Processing Systems 34 23968–23982
- Shorten C and Khoshgoftaar T M 2019 Journal of big data 6 1–48
- Donoho D 2023 arXiv preprint arXiv:2310.00865
- Dinh L, Sohl-Dickstein J and Bengio S 2016 arXiv preprint arXiv:1605.08803
- Deans M C 2002 Maximally informative statistics for localization and mapping Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No. 02CH37292) vol 2 (IEEE) pp 1824–1829
- Heavens A F, Jimenez R and Lahav O 2000 Monthly Notices of the Royal Astronomical Society 317 965–972
- Alsing J and Wandelt B 2018 Monthly Notices of the Royal Astronomical Society: Letters 476 L60–L64
- Alsing J, Wandelt B and Feeney S 2018 Monthly Notices of the Royal Astronomical Society 477 2874–2885
- Van Leeuwen T and Herrmann F J 2013 Geophysical Journal International 195 661–667
- Casella G and Berger R L 2002 Duxbury press
- Adler J and Öktem O 2018 arXiv preprint arXiv:1811.05910
- Zhao X, Curtis A and Zhang X 2022 Geophysical Journal International 228 213–239
- Williamson P 1991 Geophysics 56 202–207
- Thomson H 2023 Ultrasonic differentiation of healthy and cancerous neural tissue Ph.D. thesis University of Glasgow
- Marty P, Boehm C and Fichtner A 2021 Acoustoelastic full-waveform inversion for transcranial ultrasound computed tomography Medical Imaging 2021: Ultrasonic Imaging and Tomography vol 11602 (SPIE) pp 210–229
- Guasch L, Warner M and Ravaut C 2019 Geophysics 84 R447–R461
- Plessix R E 2006 Geophysical Journal International 167 495–503
- Virieux J and Operto S 2009 Geophysics 74 WCC1–WCC26
- Marty P, Boehm C and Fichtner A 2023 Shape optimization for transcranial ultrasound computed tomography Medical Imaging 2023: Ultrasonic Imaging and Tomography vol 12470 (SPIE) pp 77–88
- Ghosal S and Van der Vaart A 2017 Fundamentals of nonparametric Bayesian inference vol 44 (Cambridge University Press)
- Siahkoohi A, Rizzuti G and Herrmann F J 2020 Weak deep priors for seismic imaging SEG Technical Program Expanded Abstracts 2020 (Society of Exploration Geophysicists) pp 2998–3002
- Sohn K, Lee H and Yan X 2015 Advances in neural information processing systems 28
- Mirza M and Osindero S 2014 arXiv preprint arXiv:1411.1784
- Kingma D P and Ba J 2014 arXiv preprint arXiv:1412.6980