Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data (1901.06314v1)

Published 18 Jan 2019 in physics.comp-ph, cs.CV, cs.LG, and stat.ML

Abstract: Surrogate modeling and uncertainty quantification tasks for PDE systems are most often considered as supervised learning problems where input and output data pairs are used for training. The construction of such emulators is by definition a small data problem which poses challenges to deep learning approaches that have been developed to operate in the big data regime. Even in cases where such models have been shown to have good predictive capability in high dimensions, they fail to address constraints in the data implied by the PDE model. This paper provides a methodology that incorporates the governing equations of the physical model in the loss/likelihood functions. The resulting physics-constrained, deep learning models are trained without any labeled data (e.g. employing only input data) and provide comparable predictive responses with data-driven models while obeying the constraints of the problem at hand. This work employs a convolutional encoder-decoder neural network approach as well as a conditional flow-based generative model for the solution of PDEs, surrogate model construction, and uncertainty quantification tasks. The methodology is posed as a minimization problem of the reverse Kullback-Leibler (KL) divergence between the model predictive density and the reference conditional density, where the later is defined as the Boltzmann-Gibbs distribution at a given inverse temperature with the underlying potential relating to the PDE system of interest. The generalization capability of these models to out-of-distribution input is considered. Quantification and interpretation of the predictive uncertainty is provided for a number of problems.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Yinhao Zhu (14 papers)
  2. Nicholas Zabaras (21 papers)
  3. Phaedon-Stelios Koutsourelakis (30 papers)
  4. Paris Perdikaris (61 papers)
Citations (806)

Summary

Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data

The paper "Physics-Constrained Deep Learning for High-dimensional Surrogate Modeling and Uncertainty Quantification without Labeled Data" by Yinhao Zhu, Nicholas Zabaras, Phaedon-Stelios Koutsourelakis, and Paris Perdikaris proposes a novel approach to surrogate modeling and uncertainty quantification for PDE systems. The methodology crucially relies on incorporating governing physical equations into deep learning models, effectively eliminating the need for labeled datasets—traditionally a substantial requirement in training surrogate models.

The paper provides a detailed development of physics-constrained deep learning models, showcasing their application in solving high-dimensional partial differential equations (PDEs), constructing surrogate models, and quantifying uncertainties. The significance of their approach lies in the way it circumnavigates one of the main challenges in deep learning for physical systems: the need for extensive labeled data, which is often infeasible to obtain in high-dimensional settings.

Key Contributions

  • Physics-Constrained Learning: The authors introduce a loss function that integrates the physical principles governed by PDEs directly into the training process of deep learning models. This is done by minimizing a custom loss function, involving the residuals of the PDEs and boundary conditions. This allows the model to learn to adhere to physical laws without requiring labeled output data, fundamentally differing from traditional data-driven learning approaches.
  • Surrogate Models: The paper deploys convolutional encoder-decoder neural networks to create surrogate models capable of high-dimensional input-output transformations. The training is framed as an optimization problem, with the objective of minimizing the discrepancies between model-predicted and physically consistent outputs.
  • Uncertainty Quantification: Addressing the requirement for uncertainty quantification, the authors propose a probabilistic surrogate model based on conditional flow-based generative models. The optimization leverages reverse Kullback-Leibler (KL) divergence to distill a generative model of the solution conditioned on input data, furnishing a mechanism to quantify predictive uncertainties.

Methodology and Numerical Experiments

The authors first illustrate the architecture of their convolutional encoder-decoder neural networks and compare these with fully connected networks (FC-NNs). The results underscore the efficiency of convolutional neural networks (CNNs) in capturing multiscale features of the solution fields robustly and efficiently. Extensive experiments demonstrate the fidelity of CNNs in predicting solutions for deterministic PDEs, notably outperforming FC-NNs in terms of accuracy and computational efficiency.

A significant experimental component involves solving Darcy flow problems to validate the methodology. Detailed network architectures and results for linear and nonlinear Darcy flow are presented. The ability of CNNs to capture complex spatial patterns in pressure and flux fields convincingly was shown through comparisons to conventional solvers.

Surrogate Models Evaluation: Surrogates trained without labeled outputs (PCS) displayed comparable prediction capabilities to those trained with labeled data (DDS). PCS showed superior generalization characteristics, particularly under out-of-distribution conditions, proving its robustness and practical value. PCS models trained with substantial input data (e.g., 8192 samples of GRF KLE512) consistently demonstrated better prediction accuracy compared to DDS.

Probabilistic Surrogate Models: The authors employ a flow-based generative model to construct a probabilistic surrogate capable of capturing predictive distributions faithfully. The conditional Glow model notably handles uncertainty propagation tasks efficiently, with validation revealing well-calibrated uncertainty estimates across various conditions.

Implications and Future Work

The presented framework establishes a crucial advancement in surrogate modeling for high-dimensional PDEs by fundamentally reducing dependency on labeled data and rigorously embedding physical laws into learning processes. This significantly lowers the cost and expands the feasibility of deploying machine learning in scientific and engineering contexts where obtaining labeled data is challenging or expensive.

Practical Implications: The methodology enables efficient real-time predictions for complex systems without pre-computed labeled data, facilitating applications such as optimization, design exploration, and risk assessment under uncertainty. The incorporation of physical constraints ensures that model predictions remain physically plausible, a key requirement for reliability in scientific and engineering applications.

Future Developments: Several avenues for future work are identified. These include extending physics-constrained surrogates to dynamic systems, improving model generalization under unfamiliar conditions via fine-tuning or meta-learning approaches, and integrating partial simulation data with known physics for enhanced data efficiency. Further developing and scaling flow-based generative models to higher dimensions and improving the expressiveness of FC-NNs to better capture multiscale features of solutions are also suggested.

In conclusion, the research presented in this paper aligns machine learning with fundamental physical principles, proposing a powerful approach to address high-dimensional surrogate modeling and uncertainty quantification without the prerequisite of labeled data. This could significantly impact various fields requiring robust and efficient surrogate models underpinned by strong physical realism.