Interpreting and generalizing deep learning in physics-based problems with functional linear models (2307.04569v2)
Abstract: Although deep learning has achieved remarkable success in various scientific machine learning applications, its opaque nature poses concerns regarding interpretability and generalization capabilities beyond the training data. Interpretability is crucial and often desired in modeling physical systems. Moreover, acquiring extensive datasets that encompass the entire range of input features is challenging in many physics-based learning tasks, leading to increased errors when encountering out-of-distribution (OOD) data. In this work, motivated by the field of functional data analysis (FDA), we propose generalized functional linear models as an interpretable surrogate for a trained deep learning model. We demonstrate that our model could be trained either based on a trained neural network (post-hoc interpretation) or directly from training data (interpretable operator learning). A library of generalized functional linear models with different kernel functions is considered and sparse regression is used to discover an interpretable surrogate model that could be analytically presented. We present test cases in solid mechanics, fluid mechanics, and transport. Our results demonstrate that our model can achieve comparable accuracy to deep learning and can improve OOD generalization while providing more transparency and interpretability. Our study underscores the significance of interpretable representation in scientific machine learning and showcases the potential of functional linear models as a tool for interpreting and generalizing deep learning.
- Super-resolution analysis via machine learning: A survey for fluid flows. arXiv preprint arXiv:2301.10937, 2023.
- Super-resolution and denoising of 4D-Flow MRI using physics-informed deep neural nets. Computer Methods and Programs in Biomedicine, page 105729, 2020.
- Data-driven discovery of coordinates and governing equations. Proceedings of the National Academy of Sciences, 116(45):22445–22451, 2019.
- K. Duraisamy. Perspectives on machine learning-augmented reynolds-averaged and large eddy simulation models of turbulence. Physical Review Fluids, 6(5):050504, 2021.
- On transfer learning of neural networks using bi-fidelity data for uncertainty propagation. International Journal for Uncertainty Quantification, 10(6), 2020.
- Deep neural operators can serve as accurate surrogates for shape optimization: a case study for airfoils. arXiv preprint arXiv:2302.00807, 2023.
- Towards out of distribution generalization for problems in mechanics. Computer Methods in Applied Mechanics and Engineering, 400:115569, 2022.
- Parsimony as the ultimate regularizer for physics-informed machine learning. Nonlinear Dynamics, 107(3):1801–1817, 2022.
- Genetic programming based symbolic regression for analytical solutions to differential equations. arXiv preprint arXiv:2302.03175, 2023.
- Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proceedings of the National Academy of Sciences, 113(15):3932–3937, 2016.
- Toward predictive digital twins via component-based reduced-order models and interpretable machine learning. In AIAA Scitech 2020 Forum, page 0418, 2020.
- Explaining deep neural networks and beyond: A review of methods and applications. Proceedings of the IEEE, 109(3):247–278, 2021.
- A. Thampi. Interpretable AI: Building explainable machine learning systems. Simon and Schuster, 2022.
- Explainable machine learning in materials science. Npj Computational Materials, 8(1):204, 2022.
- Explainable, trustworthy, and ethical machine learning for healthcare: A survey. Computers in Biology and Medicine, page 106043, 2022.
- Transparency of deep neural networks for medical image analysis: A review of interpretability methods. Computers in Biology and Medicine, 140:105111, 2022.
- Explainable AI in aerospace for enhanced system performance. In 2021 IEEE/AIAA 40th Digital Avionics Systems Conference (DASC), pages 1–7. IEEE, 2021.
- Convolutional neural network models and interpretability for the anisotropic Reynolds stress tensor in turbulent one-dimensional flows. Journal of Turbulence, 23(1-2):1–28, 2022.
- Interpretable deep learning for prediction of Prandtl number effect in turbulent heat transfer. Journal of Fluid Mechanics, 955:A14, 2023.
- Explaining wall-bounded turbulence through deep learning. arXiv preprint arXiv:2302.01250, 2023.
- Data driven turbulence modeling in turbomachinery–an applicability study. Computers & Fluids, 238:105354, 2022.
- Interpretable models for extrapolation in scientific machine learning. arXiv preprint arXiv:2212.10283, 2022.
- L. Horváth and P. Kokoszka. Inference for functional data with applications, volume 200. Springer Science & Business Media, 2012.
- Functional data analysis. Annual Review of Statistics and its application, 3:257–295, 2016.
- S. Ullah and C. F. Finch. Applications of functional data analysis: A systematic review. BMC Medical Research Methodology, 13:1–12, 2013.
- Machine learning for cardiovascular biomechanics modeling: challenges and beyond. Annals of Biomedical Engineering, 50(6):615–627, 2022.
- C. Borggaard and H. H. Thodberg. Optimal minimal neural interpretation of spectra. Analytical Chemistry, 64(5):545–551, 1992.
- Hypothesis testing in comparative and experimental studies of function-valued traits. Evolution, 62(5):1229–1242, 2008.
- F. Ferraty and Y. Romain. The Oxford handbook of functional data analysis, 2011.
- A review and comparison of bandwidth selection methods for kernel regression. International Statistical Review, 82(2):243–274, 2014.
- S. Ghosh. Kernel smoothing: Principles, methods and applications. John Wiley & Sons, 2018.
- Comparing different nonlinear dimensionality reduction techniques for data-driven unsteady fluid flow modeling. Physics of Fluids, 34(11), 2022.
- Kernel learning for robust dynamic mode decomposition: linear and nonlinear disambiguation optimization. Proceedings of the Royal Society A, 478(2260):20210830, 2022.
- Neural operator: Learning maps between function spaces with applications to PDEs. Journal of Machine Learning Research, 24(89):1–97, 2023.
- Fourier neural operator for parametric partial differential equations. arXiv preprint arXiv:2010.08895, 2020.
- Simulating progressive intramural damage leading to aortic dissection using DeepONet: an operator–regression neural network. Journal of the Royal Society Interface, 19(187):20210670, 2022.
- A physics-guided neural operator learning approach to model biological tissues from digital image correlation measurements. Journal of Biomechanical Engineering, 144(12):121012, 2022.
- Forecasting subcritical cylinder wakes with Fourier Neural Operators. arXiv preprint arXiv:2301.08290, 2023.
- Neural operator: Graph kernel network for partial differential equations. arXiv preprint arXiv:2003.03485, 2020.
- D. G. Duffy. Green’s functions with applications. CRC press, 2015.
- S. Nair. Advanced topics in applied mathematics: for engineering and the physical sciences. Cambridge University Press, 2011.
- Deep learning for computer vision: A brief review. Computational Intelligence and Neuroscience, 2018, 2018.
- A perspective on machine learning in turbulent flows. Journal of Turbulence, 21(9-10):567–584, 2020.
- Convolutional-network models to predict wall-bounded turbulence from wall quantities. Journal of Fluid Mechanics, 928:A27, 2021.
- TONR: An exploration for a novel way combining neural network with topology optimization. Computer Methods in Applied Mechanics and Engineering, 386:114083, 2021.
- C. C. Aggarwal. Neural Networks and Deep Learning: A Textbook. Springer, 2018.
- Gaussian processes for machine learning, volume 2. MIT press Cambridge, MA, 2006.
- R. M. Neal. Bayesian learning for neural networks, volume 118. Springer Science & Business Media, 2012.
- Physics-informed neural operators. arXiv preprint arXiv:2207.05748, 2022.
- An introduction to kernel and operator learning methods for homogenization by self-consistent clustering analysis. Computational Mechanics, pages 1–25, 2023.
- Lift & learn: Physics-informed machine learning for large-scale nonlinear dynamical systems. Physica D: Nonlinear Phenomena, 406:132401, 2020.
- H. G. Müller and U. Stadtmüller. Generalized functional linear models. Annals of Statistics, 33(2):774–805, 2005.
- Kernel Smoothing in MATLAB: theory and practice of kernel smoothing. World scientific, 2012.
- H. G. Müller and F. Yao. Functional additive models. Journal of the American Statistical Association, 103(484):1534–1544, 2008.
- Functional linear regression that’s interpretable. The Annals of Statistics, 37(5A):2083–2108, 2009.
- T. DeVries and G. W. Taylor. Learning confidence for out-of-distribution detection in neural networks. arXiv preprint arXiv:1802.04865, 2018.
- Generalized out-of-distribution detection: A survey. arXiv preprint arXiv:2110.11334, 2021.
- Mechanical MNIST–distribution shift. 2022.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- EMNIST: Extending MNIST to handwritten letters. In 2017 international joint conference on neural networks (IJCNN), pages 2921–2926. IEEE, 2017.
- E. Lejeune. Mechanical MNIST: A benchmark dataset for mechanical metamodels. Extreme Mechanics Letters, 36:100659, 2020.
- Automated solution of differential equations by the finite element method, volume 84. Springer, Berlin, Heidelberg, 2012.
- Uncovering near-wall blood flow from sparse data with physics-informed neural networks. Physics of Fluids, 33(7), 2021.
- Ensemble physics informed neural networks: A framework to improve inverse transport modeling in heterogeneous domains. Physics of Fluids, 33:053616, 2023.
- S. De and A. Doostan. Neural network training using L1-regularization and bi-fidelity data. Journal of Computational Physics, 458:111010, 2022.
- Predicting high-fidelity multiphysics data from low-fidelity fluid flow and transport solvers using physics-informed neural networks. International Journal of Heat and Fluid Flow, 96:109002, 2022.
- N. Altman and C. Leger. Bandwidth selection for kernel distribution function estimation. Journal of Statistical Planning and Inference, 46(2):195–214, 1995.
- Reliable extrapolation of deep neural operators informed by physics or sparse observations. Computer Methods in Applied Mechanics and Engineering, 412:116064, 2023.
- Lassonet: A neural network with feature sparsity. The Journal of Machine Learning Research, 22(1):5633–5661, 2021.
- Deepgreen: deep learning of Green’s functions for nonlinear boundary value problems. Scientific Reports, 11(1):21614, 2021.
- Data-driven discovery of Green’s functions with human-understandable deep learning. Scientific Reports, 12(1):4824, 2022.
- Applied koopmanism. Chaos: An Interdisciplinary Journal of Nonlinear Science, 22(4):047510, 2012.
- I. Mezić. Analysis of fluid flows via spectral properties of the Koopman operator. Annual Review of Fluid Mechanics, 45:357–378, 2013.
- Learning koopman invariant subspaces for dynamic mode decomposition. Advances in neural information processing systems, 30, 2017.
- Sparsity-promoting dynamic mode decomposition. Physics of Fluids, 26(2):024103, 2014.
- Unsupervised discovery of interpretable hyperelastic constitutive laws. Computer Methods in Applied Mechanics and Engineering, 381:113852, 2021.
- Compressive sensing based machine learning strategy for characterizing the flow around a cylinder with limited pressure measurements. Physics of Fluids, 25(12):127102, 2013.
- A. Arzani and S. Dawson. Data-driven cardiovascular flow modelling: examples and opportunities. Journal of The Royal Society Interface, 18:20200802, 2021.
- Neural additive models: Interpretable machine learning with neural nets. Advances in Neural Information Processing Systems, 34:4699–4711, 2021.
- Generalized functional linear models with semiparametric single-index interactions. Journal of the American Statistical Association, 105(490):621–633, 2010.
- R. L. Panton. Incompressible flow. John Wiley & Sons, 2013.
- Neural vortex method: from finite Lagrangian particles to infinite dimensional Eulerian dynamics. Computers & Fluids, 258:105811, 2023.
- V. Isakov. Inverse problems for partial differential equations, volume 127. Springer, 2006.
- The detection of surface vibrations from interior acoustical pressure. Inverse Problems, 19(3):507, 2003.
- Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021.
- Learning green’s functions associated with time-dependent partial differential equations. Journal of Machine Learning Research, 23(218):1–34, 2022.
- Synergistic integration of deep neural networks and finite element method with applications for biomechanical analysis of human aorta. bioRxiv, pages 2023–04, 2023.
- Amirhossein Arzani (7 papers)
- Lingxiao Yuan (3 papers)
- Pania Newell (3 papers)
- Bei Wang (102 papers)