Multifidelity linear regression for scientific machine learning from scarce data (2403.08627v2)
Abstract: Machine learning (ML) methods, which fit to data the parameters of a given parameterized model class, have garnered significant interest as potential methods for learning surrogate models for complex engineering systems for which traditional simulation is expensive. However, in many scientific and engineering settings, generating high-fidelity data on which to train ML models is expensive, and the available budget for generating training data is limited, so that high-fidelity training data are scarce. ML models trained on scarce data have high variance, resulting in poor expected generalization performance. We propose a new multifidelity training approach for scientific machine learning via linear regression that exploits the scientific context where data of varying fidelities and costs are available: for example, high-fidelity data may be generated by an expensive fully resolved physics simulation whereas lower-fidelity data may arise from a cheaper model based on simplifying assumptions. We use the multifidelity data within an approximate control variate framework to define new multifidelity Monte Carlo estimators for linear regression models. We provide bias and variance analysis of our new estimators that guarantee the approach's accuracy and improved robustness to scarce high-fidelity data. Numerical results demonstrate that our multifidelity training approach achieves similar accuracy to the standard high-fidelity only approach with orders-of-magnitude reduced high-fidelity data requirements.
- A multifidelity deep operator network approach to closure for multiscale systems. Computer Methods in Applied Mechanics and Engineering, 414:116161, 2023.
- Context-aware surrogate modeling for balancing approximation and sampling costs in multifidelity importance sampling and Bayesian inverse problems. SIAM/ASA Journal on Uncertainty Quantification, 11(1):285–319, 2023.
- Model reduction and neural networks for parametric PDEs. The SMAI Journal of Computational Mathematics, 7:121–157, 2021.
- Overview of gaussian process based multi-fidelity techniques with variable relationship between fidelities, application to aerospace systems. Aerospace Science and Technology, 107:106339, 2020.
- M. Buffoni and K. Willcox. Projection-based model reduction for reacting flows. In 40th Fluid Dynamics Conference and Exhibit, page 5008, 2010.
- Multifidelity uncertainty quantification and model validation of large-scale multidisciplinary systems. Journal of Astronomical Telescopes, Instruments, and Systems, 8(3):038001, 2022.
- Multilevel Bayesian deep neural networks. arXiv preprint arXiv:2203.12961, 2022.
- Sensitivity-based scaling for approximating structural response. Journal of Aircraft, 30(2):283–288, 1993.
- S. Chaturantabut and D. Sorensen. Nonlinear model reduction via discrete empirical interpolation. SIAM Journal on Scientific Computing, 32(5):2737–2764, 2010.
- Multifidelity optimization under uncertainty for a tailless aircraft. In 2018 AIAA Non-Deterministic Approaches Conference, page 1658, 2018.
- mfegra: Multifidelity efficient global reliability analysis through active learning for failure boundary location. Structural and Multidisciplinary Optimization, 64(2):797–811, 2021.
- B. Cuenot and T. Poinsot. Asymptotic and numerical study of diffusion flames with variable lewis number and finite rate chemistry. Combustion and Flame, 104(1-2):111–137, 1996.
- On transfer learning of neural networks using bi-fidelity data for uncertainty propagation. International Journal for Uncertainty Quantification, 10(6), 2020.
- Neural network training using ℓ1superscriptℓ1\ell^{1}roman_ℓ start_POSTSUPERSCRIPT 1 end_POSTSUPERSCRIPT-regularization and bi-fidelity data. Journal of Computational Physics, 458:111010, 2022.
- Bi-fidelity modeling of uncertain and partially unknown systems using DeepONets. arXiv preprint arXiv:2204.00997, 2022.
- The cost-accuracy trade-off in operator learning with neural networks. Journal of Machine Learning, 1(3):299–341, 2022.
- A DeepONet multi-fidelity approach for residual learning in reduced order modeling. arXiv preprint arXiv:2302.12682, 2023.
- Context-aware learning of hierarchies of low-fidelity models for multi-fidelity uncertainty quantification. Computer Methods in Applied Mechanics and Engineering, 406:115908, 2023.
- Linear regression-based multifidelity surrogate for disturbance amplification in multiphase explosion. Structural and Multidisciplinary Optimization, 60:2205–2220, 2019.
- Multilevel Monte Carlo variational inference. The Journal of Machine Learning Research, 22(1):12741–12784, 2021.
- Multilevel Monte Carlo learning. arXiv preprint arXiv:2102.08734, 2021.
- Michael B Giles. Multilevel Monte Carlo path simulation. Operations research, 56(3):607–617, 2008.
- Michael B Giles. Multilevel Monte Carlo methods. Acta numerica, 24:259–328, 2015.
- Multi-fidelity regression using artificial neural networks: Efficient approximation of parameter-dependent output quantities. Computer methods in applied mechanics and engineering, 389:114378, 2022.
- Raphael T Haftka. Combining global and local approximations. AIAA journal, 29(9):1523–1525, 1991.
- Non-intrusive reduced order modeling of nonlinear problems using neural networks. Journal of Computational Physics, 363:55–78, 2018.
- Matrix analysis. Cambridge university press, 2012.
- Multilayer feedforward networks are universal approximators. Neural networks, 2(5):359–366, 1989.
- Multifidelity deep operator networks. arXiv preprint arXiv:2204.09157, 2022.
- Multifidelity robust topology optimization for material uncertainties with digital manufacturing. In AIAA SCITECH 2023 Forum, page 2038, 2023.
- Su Jiang and Louis J Durlofsky. Use of multifidelity training data and transfer learning for efficient construction of subsurface flow surrogate models. Journal of Computational Physics, 474:111800, 2023.
- Predicting the output from a complex computer code when fast approximations are available. Biometrika, 87(1):1–13, 2000.
- Response surface models combining linear and euler aerodynamics for supersonic transport design. Journal of Aircraft, 36(1):75–86, 1999.
- Recursive co-kriging model for design of computer experiments with multiple levels of fidelity. International Journal for Uncertainty Quantification, 4(5), 2014.
- Fourier neural operator for parametric partial differential equations. ICLR 2021; arXiv:2010.08895, 2020.
- Multi-fidelity physics-constrained neural network and its application in materials modeling. Journal of Mechanical Design, 141(12):121403, 2019.
- Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021.
- Multifidelity deep neural operators for efficient learning of partial differential equations with application to fast inverse design of nanoscale heat transport. Physical Review Research, 4(2):023210, 2022.
- Efficient PDE-constrained optimization under high-dimensional uncertainty using derivative-informed neural operators. arXiv preprint arXiv:2305.20053, 2023.
- Contour location via entropy reduction leveraging multiple information sources. Advances in neural information processing systems, 31, 2018.
- Prediction and uncertainty quantification of shale well performance using multifidelity Monte Carlo. Gas Science and Engineering, 110:204877, 2023.
- A composite neural network that learns from multi-fidelity data: Application to function approximation and inverse PDE problems. Journal of Computational Physics, 401:109020, 2020.
- I. Mezić. Analysis of fluid flows via spectral properties of the Koopman operator. Annual Review of Fluid Mechanics, 45:357–378, 2013.
- Igor Mezić. Spectral properties of dynamical systems, model reduction and decompositions. Nonlinear Dynamics, 41(1):309–325, 2005.
- Bayesian, multifidelity operator learning for complex engineering systems–a position paper. Journal of Computing and Information Science in Engineering, 23(6), 2023.
- The random feature model for input-output maps between banach spaces. SIAM Journal on Scientific Computing, 43(5):A3212–A3243, 2021.
- Multifidelity approaches for optimization under uncertainty. International Journal for numerical methods in Engineering, 100(10):746–772, 2014.
- Derivative informed neural operator: An efficient framework for high-dimensional parametric derivative learning. arXiv preprint arXiv:2206.10745, 2022.
- Multi-fidelity gaussian process regression for prediction of random fields. Journal of Computational Physics, 336:36–50, 2017.
- Optimal model management for multifidelity Monte Carlo estimation. SIAM Journal on Scientific Computing, 38(5):A3163–A3194, 2016.
- Data-driven operator inference for nonintrusive projection-based model reduction. Computer Methods in Applied Mechanics and Engineering, 306:196–215, 2016.
- Nonlinear information fusion algorithms for data-efficient multi-fidelity modelling. Proceedings of the Royal Society A: Mathematical, Physical and Engineering Sciences, 473(2198):20160751, 2017.
- Multi-information source optimization. Advances in neural information processing systems, 30, 2017.
- Reduced operator inference for nonlinear partial differential equations. SIAM Journal on Scientific Computing, 44(4), 2022.
- Lift & Learn: Physics-informed machine learning for large-scale nonlinear dynamical systems. Physica D: Nonlinear Phenomena, Volume 406, 2020.
- Multifidelity Monte Carlo estimation of variance and sensitivity indices. SIAM/ASA Journal on Uncertainty Quantification, 6(2):683–706, 2018.
- A data-driven multi-fidelity physics-informed learning framework for smart manufacturing: a composites processing case study. In 2022 IEEE 5th International Conference on Industrial Cyber-Physical Systems (ICPS), pages 01–07. IEEE, 2022.
- Efficiency of multivariate control variates in monte carlo simulation. Operations Research, 33(3):661–677, 1985.
- Projection-based multifidelity linear regression for data-poor applications. In AIAA SCITECH 2023 Forum, page 0916, 2023.
- A multi-fidelity surrogate model based on support vector regression. Structural and Multidisciplinary Optimization, 61:2363–2375, 2020.
- On multilevel Monte Carlo unbiased gradient estimation for deep latent variable models. In International Conference on Artificial Intelligence and Statistics, pages 3925–3933. PMLR, 2021.
- Transfer learning on multifidelity data. Journal of Machine Learning for Modeling and Computing, 3(1), 2022.
- Projection-based model reduction: Formulations for physics-based machine learning. Computers and Fluids, 179:704–717, 2019.
- Further analysis of multilevel monte carlo methods for elliptic PDEs with random coefficients. Numerische Mathematik, 125:569–600, 2013.
- A data–driven approximation of the Koopman operator: Extending dynamic mode decomposition. Journal of Nonlinear Science, 25:1307–1346, 2015.
- A kernel-based method for data-driven Koopman spectral analysis. Journal of Computational Dynamics, 2(2):247–265, 2015.
- Multifidelity surrogate based on single linear regression. AIAA Journal, 56(12):4944–4952, 2018.
- Elizabeth Qian (8 papers)
- Anirban Chaudhuri (8 papers)
- Dayoung Kang (2 papers)
- Vignesh Sella (1 paper)