Multi-Fidelity Residual Neural Processes for Scalable Surrogate Modeling (2402.18846v2)
Abstract: Multi-fidelity surrogate modeling aims to learn an accurate surrogate at the highest fidelity level by combining data from multiple sources. Traditional methods relying on Gaussian processes can hardly scale to high-dimensional data. Deep learning approaches utilize neural network based encoders and decoders to improve scalability. These approaches share encoded representations across fidelities without including corresponding decoder parameters. This hinders inference performance, especially in out-of-distribution scenarios when the highest fidelity data has limited domain coverage. To address these limitations, we propose Multi-fidelity Residual Neural Processes (MFRNP), a novel multi-fidelity surrogate modeling framework. MFRNP explicitly models the residual between the aggregated output from lower fidelities and ground truth at the highest fidelity. The aggregation introduces decoders into the information sharing step and optimizes lower fidelity decoders to accurately capture both in-fidelity and cross-fidelity information. We show that MFRNP significantly outperforms state-of-the-art in learning partial differential equations and a real-world climate modeling task. Our code is published at: https://github.com/Rose-STL-Lab/MFRNP
- The era5 global reanalysis. Quarterly Journal of the Royal Meteorological Society, 146(730):1999–2049, 2020.
- Smart transportation planning: Data, models, and algorithms. Transportation Engineering, 2:100013, 2020.
- Survey of multifidelity methods in uncertainty propagation, inference, and optimization. Siam Review, 60(3):550–591, 2018.
- Matthias Seeger. Gaussian processes for machine learning. International journal of neural systems, 14(02):69–106, 2004.
- Multifidelity information fusion algorithms for high-dimensional systems and massive data sets. SIAM Journal on Scientific Computing, 38(4):B521–B538, 2016.
- Multi-fidelity high-order gaussian processes for physical simulation. In International Conference on Artificial Intelligence and Statistics, pages 847–855. PMLR, 2021.
- Gaussian processes for regression. Advances in neural information processing systems, 8, 1995.
- Carl Edward Rasmussen. Gaussian processes in machine learning. In Summer school on machine learning, pages 63–71. Springer, 2003.
- Deep gaussian processes. In Artificial intelligence and statistics, pages 207–215. PMLR, 2013.
- Deep multi-fidelity gaussian processes. arXiv preprint arXiv:1604.07484, 2016.
- Doubly stochastic variational inference for deep gaussian processes. In NIPS, 2017.
- Deep kernel learning. In Artificial intelligence and statistics, pages 370–378. PMLR, 2016.
- Mfpc-net: Multi-fidelity physics-constrained neural process. arXiv preprint arXiv:2010.01378, 2020.
- Multi-fidelity modeling with different input domain definitions using deep gaussian processes. Structural and Multidisciplinary Optimization, 63(5):2267–2288, 2021.
- Deep multi-fidelity active learning of high-dimensional outputs. In International Conference on Artificial Intelligence and Statistics, pages 1694–1711. PMLR, 2022a.
- Batch multi-fidelity active learning with budget constraints. In Advances in Neural Information Processing Systems, 2022b.
- Disentangled multi-fidelity deep bayesian active learning. arXiv preprint arXiv:2305.04392, 2023.
- Neural processes. arXiv preprint arXiv:1807.01622, 2018a.
- Qi Wang and Herke Van Hoof. Doubly stochastic variational inference for neural processes with hierarchical latent variables. In International Conference on Machine Learning, pages 10018–10028. PMLR, 2020.
- The neural process family: Survey, applications and perspectives. arXiv preprint arXiv:2209.00517, 2022.
- Overview of gaussian process based multi-fidelity techniques with variable relationship between fidelities, application to aerospace systems. Aerospace Science and Technology, 107:106339, 2020.
- S. Hosking. Multifidelity climate modelling, github. https://github.com/scotthosking/mf_modelling, 2020.
- Multifidelity prediction in wildfire spread simulation: Modeling, uncertainty quantification and sensitivity analysis. Environmental Modelling & Software, 141:105050, 2021.
- Predicting the output from a complex computer code when fast approximations are available. Biometrika, 87(1):1–13, 2000.
- Recursive co-kriging model for design of computer experiments with multiple levels of fidelity. International Journal for Uncertainty Quantification, 4(5), 2014.
- Multi-fidelity modelling via recursive co-kriging and gaussian–markov random fields. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 471(2179):20150018, 2015.
- Nonlinear information fusion algorithms for data-efficient multi-fidelity modelling. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 473(2198):20160751, 2017.
- Deep gaussian processes for multi-fidelity modeling. arXiv preprint arXiv:1903.07320, 2019.
- Infinite-fidelity coregionalization for physical simulation. Advances in Neural Information Processing Systems, 35:25965–25978, 2022c.
- Multi-fidelity regression using artificial neural networks: efficient approximation of parameter-dependent output quantities. Computer methods in applied mechanics and engineering, 389:114378, 2022.
- A composite neural network that learns from multi-fidelity data: Application to function approximation and inverse pde problems. Journal of Computational Physics, 401:109020, 2020.
- Multi-fidelity bayesian neural networks: Algorithms and applications. Journal of Computational Physics, 438:110361, 2021.
- On transfer learning of neural networks using bi-fidelity data for uncertainty propagation. International Journal for Uncertainty Quantification, 10(6), 2020.
- Multi-fidelity bayesian optimization via deep neural networks. Advances in Neural Information Processing Systems, 33:8521–8531, 2020.
- Allocation strategies for high fidelity models in the multifidelity regime. SIAM/ASA Journal on Uncertainty Quantification, 7(1):203–231, 2019.
- Multi-fidelity bayesian optimisation with continuous approximations. In International Conference on Machine Learning, pages 1799–1808. PMLR, 2017.
- Conditional neural processes. In International Conference on Machine Learning, pages 1704–1713. PMLR, 2018b.
- Attentive neural processes. In International Conference on Learning Representations, 2018.
- The functional neural process. Advances in Neural Information Processing Systems, 2019.
- Sequential neural processes. Advances in Neural Information Processing Systems, 32:10254–10264, 2019.
- Evaluation of climate models. In Climate change 2013: the physical science basis. Contribution of Working Group I to the Fifth Assessment Report of the Intergovernmental Panel on Climate Change, pages 741–866. Cambridge University Press, 2014.
- Challenges in combining projections from multiple climate models. Journal of Climate, 23(10):2739–2758, 2010.
- Cmip1 evaluation and intercomparison of coupled climate models. Climate Dynamics, 17:83–106, 2001.
- Performance metrics for climate models. Journal of Geophysical Research: Atmospheres, 113(D6), 2008.
- The use of the multi-model ensemble in probabilistic climate projections. Philosophical transactions of the royal society A: mathematical, physical and engineering sciences, 365(1857):2053–2075, 2007.
- Calculation of average, uncertainty range, and reliability of regional climate changes from aogcm simulations via the “reliability ensemble averaging”(rea) method. Journal of climate, 15(10):1141–1158, 2002.
- Esd reviews: Model dependence in multi-model climate ensembles: Weighting, sub-selection and out-of-sample testing. Earth System Dynamics, 10(1):91–105, 2019.
- Downscaling multi-model climate projection ensembles with deep learning (deepesd): contribution to cordex eur-44. Geoscientific Model Development, 15(17):6747–6758, 2022.
- The era-interim reanalysis: Configuration and performance of the data assimilation system. Quarterly Journal of the royal meteorological society, 137(656):553–597, 2011.
- Multimodel ensemble analysis with neural network gaussian processes. The Annals of Applied Statistics, 17(4):3403–3425, 2023.
- Downscaling techniques for high-resolution climate projections: From global change to local impacts. Cambridge University Press, 2021.
- Louise Olsen-Kettle. Numerical solution of partial differential equations. Lecture notes at University of Queensland, Australia, 2011.
- Learning to control pdes with differentiable physics. arXiv preprint arXiv:2001.07457, 2020.
- Alexandre Joel Chorin. Numerical solution of the navier-stokes equations. Mathematics of computation, 22(104):745–762, 1968.
- Climatebench v1. 0: A benchmark for data-driven climate projections. Journal of Advances in Modeling Earth Systems, 14(10):e2021MS002954, 2022.
- The scenario model intercomparison project (scenariomip) for cmip6. Geoscientific Model Development, 9(9):3461–3482, 2016.
- Principal component analysis. Chemometrics and intelligent laboratory systems, 2(1-3):37–52, 1987.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
- Deep multi-fidelity active learning of high-dimensional outputs, 2021.
- Multi-fidelity hierarchical neural processes. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, pages 2029–2038, 2022.
- A climate model projection weighting scheme accounting for performance and interdependence. Geophysical Research Letters, 44(4):1909–1918, 2017.
- Configuration and spin-up of access-cm2, the new generation australian community climate and earth system simulator coupled model. Journal of Southern Hemisphere Earth Systems Science, 70(1):225–251, 2020.
- Bcc-csm2-hr: a high-resolution version of the beijing climate center climate system model. Geoscientific Model Development, 14(5):2977–3006, 2021.
- Global mean climate and main patterns of variability in the cmcc-cm2 coupled model. Journal of Advances in Modeling Earth Systems, 11(1):185–209, 2019.
- The gfdl earth system model version 4.1 (gfdl-esm 4.1): Overall coupled model description and simulation characteristics. Journal of Advances in Modeling Earth Systems, 12(11):e2019MS002015, 2020.
- EM Volodin. Possible climate change in russia in the 21st century based on the inm-cm5-0 climate model. Russian Meteorology and Hydrology, 47(5):327–333, 2022.
- Presentation and evaluation of the ipsl-cm6a-lr climate model. Journal of Advances in Modeling Earth Systems, 12(7):e2019MS002010, 2020.
- Nims-kma kace1. 0-g model output prepared for cmip6 scenariomip. 2019.
- Ronald Stouffer. Ipcc ddc: Ua mcm-ua-1-0 model output prepared for cmip6 scenariomip. 2019.
- The meteorological research institute earth system model version 2.0, mri-esm2. 0: Description and basic evaluation of the physical component. Journal of the Meteorological Society of Japan. Ser. II, 97(5):931–965, 2019.
- The canadian earth system model version 5 (canesm5. 0.3). Geoscientific Model Development, 12(11):4823–4873, 2019.
- Max planck institute earth system model (mpi-esm1. 2) for the high-resolution model intercomparison project (highresmip). Geoscientific Model Development, 12(7):3241–3281, 2019.
- Overview of the norwegian earth system model (noresm2) and key climate response of cmip6 deck, historical, and scenario simulations. Geoscientific Model Development, 13(12):6165–6200, 2020.
- Ukesm1: Description and evaluation of the uk earth system model. Journal of Advances in Modeling Earth Systems, 11(12):4513–4558, 2019.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.
- Ruijia Niu (5 papers)
- Dongxia Wu (14 papers)
- Kai Kim (2 papers)
- Yi-An Ma (49 papers)
- Duncan Watson-Parris (21 papers)
- Rose Yu (84 papers)