SineNet: Learning Temporal Dynamics in Time-Dependent Partial Differential Equations (2403.19507v1)
Abstract: We consider using deep neural networks to solve time-dependent partial differential equations (PDEs), where multi-scale processing is crucial for modeling complex, time-evolving dynamics. While the U-Net architecture with skip connections is commonly used by prior studies to enable multi-scale processing, our analysis shows that the need for features to evolve across layers results in temporally misaligned features in skip connections, which limits the model's performance. To address this limitation, we propose SineNet, consisting of multiple sequentially connected U-shaped network blocks, referred to as waves. In SineNet, high-resolution features are evolved progressively through multiple stages, thereby reducing the amount of misalignment within each stage. We furthermore analyze the role of skip connections in enabling both parallel and sequential processing of multi-scale information. Our method is rigorously tested on multiple PDE datasets, including the Navier-Stokes equations and shallow water equations, showcasing the advantages of our proposed approach over conventional U-Nets with a comparable parameter budget. We further demonstrate that increasing the number of waves in SineNet while maintaining the same number of parameters leads to a monotonically improved performance. The results highlight the effectiveness of SineNet and the potential of our approach in advancing the state-of-the-art in neural PDE solver design. Our code is available as part of AIRS (https://github.com/divelab/AIRS).
- Non-uniqueness of leray solutions of the forced navier-stokes equations. Annals of Mathematics, 196(1):415–455, 2022.
- John Anderson. Fundamentals of Aerodynamics. McGraw-Hill, 2017.
- Layer normalization. arXiv preprint arXiv:1607.06450, 2016.
- Message passing neural PDE solvers. In International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=vSix3HPYKSU.
- U-net architectures for fast prediction in fluid mechanics. 2019.
- Ricky T. Q. Chen. torchdiffeq, 2018. URL https://github.com/rtqichen/torchdiffeq.
- Neural ordinary differential equations. Advances in neural information processing systems, 31, 2018.
- Navier-stokes equations. University of Chicago Press, 2020.
- Deformable convolutional networks. In Proceedings of the IEEE international conference on computer vision, pp. 764–773, 2017.
- Learning to correct spectral methods for simulating turbulent flows. Transactions on Machine Learning Research, 2023. ISSN 2835-8856. URL https://openreview.net/forum?id=wNBARGxoJn.
- Stacked deconvolutional network for semantic segmentation. IEEE Transactions on Image Processing, 2019.
- Graph u-nets. In international conference on machine learning, pp. 2083–2092. PMLR, 2019.
- Towards multi-spatiotemporal-scale generalized PDE modeling. Transactions on Machine Learning Research, 2023. ISSN 2835-8856. URL https://openreview.net/forum?id=dPSTDbGtBY.
- Deep residual learning for image recognition. In Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 770–778, 2016.
- Group equivariant Fourier neural operators for partial differential equations. In Proceedings of the 40th International Conference on Machine Learning, 2023.
- Gaussian error linear units (gelus). arXiv preprint arXiv:1606.08415, 2016.
- Denoising diffusion probabilistic models. Advances in Neural Information Processing Systems, 33:6840–6851, 2020.
- phiflow: A differentiable pde solving framework for deep learning via physical simulations. In NeurIPS workshop, volume 2, 2020a.
- Learning to control pdes with differentiable physics. In International Conference on Learning Representations, 2020b. URL https://openreview.net/forum?id=HyeSin4FPB.
- Physics-embedded neural networks: Graph neural pde solvers with mixed boundary conditions. Advances in Neural Information Processing Systems, 35:23218–23229, 2022.
- Adam: A method for stochastic optimization. 2015.
- Generalizing to new physical systems via context-informed dynamics model. In International Conference on Machine Learning, pp. 11283–11301. PMLR, 2022.
- milankl/speedyweather. jl: v0. 2.1. Version v0, 2, 2022.
- Machine learning–accelerated computational fluid dynamics. Proceedings of the National Academy of Sciences, 118(21):e2101784118, 2021.
- Neural operator: Learning maps between function spaces. arXiv preprint arXiv:2108.08481, 2021.
- Gradient-based learning applied to document recognition. Proceedings of the IEEE, 86(11):2278–2324, 1998.
- Multipole graph neural operator for parametric partial differential equations. Advances in Neural Information Processing Systems, 33:6755–6766, 2020.
- Fourier neural operator for parametric partial differential equations. In International Conference on Learning Representations, 2021a. URL https://openreview.net/forum?id=c8P9NQVtmnO.
- Physics-informed neural operator for learning partial differential equations. arXiv preprint arXiv:2111.03794, 2021b.
- Fourier neural operator with learned deformations for pdes on general geometries. arXiv preprint arXiv:2207.05209, 2022.
- Learning the dynamics of physical systems from sparse observations with finite element networks. In International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=HFmAukZ-k-2.
- Generative diffusion for 3d turbulent flows. arXiv preprint arXiv:2306.01776, 2023.
- Pde-refiner: Achieving accurate long rollouts with neural pde solvers. arXiv preprint arXiv:2308.05732, 2023.
- Decoupled weight decay regularization. In International Conference on Learning Representations, 2019.
- Learning nonlinear operators via DeepONet based on the universal approximation theorem of operators. Nature Machine Intelligence, 3(3):218–229, 2021.
- Metaphysica: Ood robustness in physics-informed machine learning. In ICLR 2023 Workshop on Physics for Machine Learning, 2023.
- Stacked hourglass networks for human pose estimation. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part VIII 14, pp. 483–499. Springer, 2016.
- Improved denoising diffusion probabilistic models. In International Conference on Machine Learning, pp. 8162–8171. PMLR, 2021.
- Pytorch: An imperative style, high-performance deep learning library. Advances in neural information processing systems, 32, 2019.
- Film: Visual reasoning with a general conditioning layer. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
- Learning mesh-based simulation with graph networks. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=roNqYL0_XP.
- Transform once: Efficient operator learning in frequency domain. Advances in Neural Information Processing Systems, 35:7947–7959, 2022.
- U-no: U-shaped neural operators. arXiv preprint arXiv:2204.11127, 2022.
- Physics-informed neural networks: A deep learning framework for solving forward and inverse problems involving nonlinear partial differential equations. Journal of Computational physics, 378:686–707, 2019.
- Convolutional neural operators. arXiv preprint arXiv:2302.01178, 2023.
- U-net: Convolutional networks for biomedical image segmentation. In Medical Image Computing and Computer-Assisted Intervention–MICCAI 2015: 18th International Conference, Munich, Germany, October 5-9, 2015, Proceedings, Part III 18, pp. 234–241. Springer, 2015.
- Learning to simulate complex physics with graph networks. In International conference on machine learning, pp. 8459–8468. PMLR, 2020.
- Stacked u-nets:a no-frills approach to natural image segmentation. arXiv:1804.10343, 2018.
- Learned coarse models for efficient turbulence simulatation. In International Conference on Learning Representations, 2022. URL https://openreview.net/forum?id=msRBojTz-Nh.
- Pdebench: An extensive benchmark for scientific machine learning. Advances in Neural Information Processing Systems, 35:1596–1611, 2022.
- Factorized fourier neural operators. In The Eleventh International Conference on Learning Representations, 2023. URL https://openreview.net/forum?id=tmIiMPl4IPa.
- Solver-in-the-loop: Learning from differentiable physics to interact with iterative pde-solvers. Advances in Neural Information Processing Systems, 33:6111–6122, 2020.
- Attention is all you need. Advances in neural information processing systems, 30, 2017.
- Cornelis Boudewijn Vreugdenhil. Numerical methods for shallow-water flow, volume 13. Springer Science & Business Media, 1994.
- Towards physics-informed deep learning for turbulent flow prediction. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pp. 1457–1466, 2020.
- Meta-learning dynamics forecasting using task inference. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho (eds.), Advances in Neural Information Processing Systems, 2022. URL https://openreview.net/forum?id=BsSP7pZGFQO.
- Learning the solution operator of parametric partial differential equations with physics-informed deeponets. Science advances, 7(40):eabi8605, 2021.
- Learning controllable adaptive simulation for multi-scale physics. In NeurIPS 2022 AI for Science: Progress and Promises, 2022. URL https://openreview.net/forum?id=PhktEpJHU3.
- W-net: A deep model for fully unsupervised image segmentation. arXiv preprint arXiv:1711.08506, 2017.
- Artificial intelligence for science in quantum, atomistic, and continuum systems. arXiv preprint arXiv:2307.08423, 2023.
- Juntang Zhuang. Laddernet: Multi-path networks based on u-net for medical image segmentation. arXiv preprint arXiv:1810.07810, 2018.