Stacked tensorial neural networks for reduced-order modeling of a parametric partial differential equation (2312.14979v1)
Abstract: Tensorial neural networks (TNNs) combine the successes of multilinear algebra with those of deep learning to enable extremely efficient reduced-order models of high-dimensional problems. Here, I describe a deep neural network architecture that fuses multiple TNNs into a larger network, intended to solve a broader class of problems than a single TNN. I evaluate this architecture, referred to as a "stacked tensorial neural network" (STNN), on a parametric PDE with three independent variables and three parameters. The three parameters correspond to one PDE coefficient and two quantities describing the domain geometry. The STNN provides an accurate reduced-order description of the solution manifold over a wide range of parameters. There is also evidence of meaningful generalization to parameter values outside its training data. Finally, while the STNN architecture is relatively simple and problem agnostic, it can be regularized to incorporate problem-specific features like symmetries and physical modeling assumptions.
- B. Haasdonk, Model reduction and approximation: theory and algorithms 15, 65 (2017).
- A. Miranville and S. Zelik, “Chapter 3 attractors for dissipative partial differential equations in bounded and unbounded domains,” in Handbook of Differential Equations: Evolutionary Equations (Elsevier, 2008) p. 103–200.
- J. C. Robinson, Cambridge texts in applied mathematics: Infinite-dimensional dynamical systems: An introduction to dissipative parabolic PDEs and the theory of global attractors series number 28, Cambridge texts in applied mathematics (Cambridge University Press, Cambridge, England, 2001).
- S. A. Faroughi, N. Pawar, C. Fernandes, M. Raissi, S. Das, N. K. Kalantari, and S. K. Mahjour, “Physics-guided, physics-informed, and physics-encoded neural networks in scientific computing,” (2022).
- S. Goswami, A. Bora, Y. Yu, and G. E. Karniadakis, “Physics-informed deep neural operator networks,” in Machine Learning in Modeling and Simulation: Methods and Applications, edited by T. Rabczuk and K.-J. Bathe (Springer International Publishing, Cham, 2023) pp. 219–254.
- S. Fresca and A. Manzoni, Comput. Methods. Appl. Mech. Eng. 388, 114181 (2022).
- C. Cercignani, Mathematical Methods in Kinetic Theory (Springer US, 1990).
- G. C. Pomraning, Nucl. Sci. Eng. 112, 239–255 (1992).
- R. G. McClarren, Trans. Theory Stat. Phys. 39, 73–109 (2010).
- A. Kokhanovsky, ed., Springer Series in Light Scattering: Volume 3: Radiative Transfer and Light Scattering (Springer International Publishing, 2019).
- D. Mumford, “Elastica and computer vision,” in Algebraic Geometry and its Applications, edited by C. L. Bajaj (Springer New York, New York, NY, 1994) pp. 491–506.
- C. G. Wagner and R. Beals, J. Phys. A 52, 115204 (2019).
- D. Zwillinger and V. Dobrushkin, Handbook of differential equations, 4th ed., Advances in Applied Mathematics (Chapman & Hall/CRC, Philadelphia, PA, 2021).
- “CuPy: Numpy & scipy for GPU (version 10.6.0),” https://github.com/cupy/cupy.
- J. A. Reyes and E. M. Stoudenmire, Mach. Learn. Sci. Technol. 2, 035036 (2021).
- T. G. Kolda and B. W. Bader, SIAM Review 51, 455–500 (2009).
- I. V. Oseledets, SIAM J. Sci. Comput. 33, 2295–2317 (2011).
- S. L. Brunton and J. N. Kutz, Data-Driven Science and Engineering (Cambridge University Press, 2019).
- K. Kormann, SIAM J. Sci. Comput. 37, B613–B632 (2015).
- M. Kiffner and D. Jaksch, Phys. Rev. Fluids 8, 124101 (2023).
- E. Ye and N. Loureiro, arXiv preprint arXiv:2311.07756 (2023).
- M. E. Widder and U. M. Titulaer, J. Stat. Phys. 56, 471–498 (1989).
- L. A. Viehland, “Moment methods for solving the boltzmann equation,” in Gaseous Ion Mobility, Diffusion, and Reaction (Springer International Publishing, Cham, 2018) pp. 127–154.
- M. Abadi et al., “TensorFlow: Large-scale machine learning on heterogeneous systems,” (2015), software available from tensorflow.org.
- “t3f (version 1.1.0),” https://github.com/Bihaqo/t3f.
- X. Glorot and Y. Bengio, in Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, Proc. Mach. Learn. Res., Vol. 9, edited by Y. W. Teh and M. Titterington (PMLR, 2010) pp. 249–256.
- C. G. Wagner, Mathematics of nonequilibrium steady states in dilute active matter, Ph.D. thesis (2020).
- P. Morse and H. Feshbach, Methods of Theoretical Physics (McGraw-Hill, 1953).
- N. Papamichael and N. Stylianopoulos, Numerical conformal mapping: Domain decomposition and the mapping of quadrilaterals (World Scientific Publishing, Singapore, Singapore, 2010).
- J. Berg and K. Nyström, Neurocomputing 317, 28–41 (2018).
- J.-K. Seo, Scientific Reports 12 (2022), 10.1038/s41598-022-18315-4.