Papers
Topics
Authors
Recent
Search
2000 character limit reached

Decoupled Latent Optimization of Diffusion Models for Full Waveform Inversion

Published 12 Jun 2026 in cs.LG | (2606.14139v1)

Abstract: Full waveform inversion (FWI) recovers subsurface velocity from seismic recordings by solving a severely ill-posed, nonconvex PDE-constrained optimization. Classical regularizers stabilize the inversion but fail to reproduce realistic geological structures; recent diffusion-prior methods improve realism at the cost of a fragile trade-off between data fidelity and prior consistency. We propose Decoupled Latent Optimization (DLO), which relaxes the standard latent-optimization formulation into a quadratic-penalty objective over an auxiliary physical variable and a latent variable. The data-fidelity gradient acts in physical space, the diffusion sampler contributes only through a decoded prior sample, and the standard smoothed-velocity initialization of classical FWI is preserved. On the OpenFWI benchmark, DLO outperforms classical regularizers and existing diffusion-based methods under clean, noisy, and missing-trace acquisitions. The prior, trained on 70*70 OpenFWI models, transfers directly to the Marmousi and Overthrust benchmarks, where DLO recovers intricate fault structures and remains robust to initialization smoothing and measurement noise.

Authors (2)

Summary

  • The paper proposes a quadratic-penalty strategy to decouple physical data fidelity from diffusion-based prior guidance.
  • It integrates PDE-constrained inversion with pretrained DDPM priors to achieve improved reconstruction accuracy and effective uncertainty quantification.
  • Experiments on OpenFWI and benchmarks like Marmousi and SEG/EAGE Overthrust confirm DLO’s robustness under noisy, incomplete, and out-of-distribution conditions.

Decoupled Latent Optimization of Diffusion Models for Full Waveform Inversion

Introduction and Motivation

This paper introduces Decoupled Latent Optimization (DLO), a framework that leverages pretrained diffusion models as priors for Full Waveform Inversion (FWI), a high-dimensional, ill-posed PDE-constrained inverse problem ubiquitous in seismic imaging. Classical regularizers such as Tikhonov and total variation (TV) offer stabilization but lack the expressive power to recover realistic geological structures, while recent advances in diffusion-prior regularization for PDE inverse problems have shown promise yet suffer from inconsistent data/prior trade-offs and optimization fragility. DLO proposes a principled relaxation of latent optimization: it introduces a quadratic-penalty scheme over an auxiliary physical variable and a latent variable, allowing the explicit decoupling of data-fidelity optimization (operating strictly in physical space) from guidance by the diffusion-generated prior.

Background: FWI and Diffusion Model Regularization

Standard FWI seeks to minimize the discrepancy between observed and simulated seismic data by solving a highly nonconvex optimization problem, with gradients typically obtained via the adjoint-state method. Due to nonuniqueness and sensitivity to initialization, regularization is essential but classical approaches are insufficient for capturing subsurface complexities evident in geological structures. Diffusion models, notably DDPMs, score-based SDEs, and their discretizations (e.g., DDIM), have emerged as high-quality priors with strong generative and generalization capacity, especially in image-based inverse problems. However, prior approaches embedding diffusion priors into FWI either perform constraint injection directly into the noisy sampling trajectory (risking inconsistency between physics and prior) or add diffusion-based regularization terms with only indirect influence on solution fidelity. Approaches optimizing in the diffusion latent space exclusively (e.g., D-Flow, DMPlug) suffer from optimization instability and forgo the advantages of physical-space initialization, both of which are crucial for highly nonlinear, PDE-governed settings such as FWI. Figure 1

Figure 1: Forward modeling and FWI inversion pipeline illustrating the wave equation with surface sources and simulated seismic surveys.

DLO Formulation

DLO recasts the latent optimization objective as a quadratic-penalty relaxation: rather than enforcing v=Rθ(z)v = R_\theta(z) as a hard constraint (with RθR_\theta the deterministic DDIM sampler for the latent zz), the objective is

minv,z  F(v)dobs2+λvRθ(z)2,\min_{v,\,z}\;\|\mathcal{F}(v) - \mathbf{d}_\text{obs}\|^2 + \lambda \|v - R_\theta(z)\|^2,

where F\mathcal{F} is the forward PDE operator and the penalty hyperparameter λ>0\lambda > 0 balances data fidelity and prior consistency. This decoupling admits alternating minimization: gradients for vv are computed via adjoint-state methods (not requiring backpropagation through the sampler chain), and the zz-updates are performed via gradient descent through the DDIM chain, using automatic differentiation. This approach preserves the advantages of classical smoothed-model initialization and avoids the instability of hard prior projection or backpropagation through deep PDE-sampler compositions. Figure 2

Figure 2: DLO algorithm overview with the physical and prior velocity fields jointly evolved via alternating minimization.

Convergence Analysis

Theoretical analysis confirms that as λ\lambda \to \infty, minimizers of the relaxed penalty formulation converge to minimizers of the original constrained problem, recovering strict prior consistency in the limit. Moreover, for finite (well-chosen) λ\lambda, DLO achieves a trade-off between numerical stability and prior adherence. The alternating block-coordinate minimization framework is robust to inexact updates, and the decoupling enables efficient application of the adjoint-state method for PDE gradients.

Experimental Results

OpenFWI Benchmarks

DLO is systematically benchmarked on OpenFWI, spanning four geologically distinct data families (FV-B, FF-B, CV-B, CF-B). For each, an independent DDPM prior is trained and reused (rather than finetuned). Comparisons include unregularized FWI, Tikhonov regularized, TV regularized FWI, and state-of-the-art diffusion-based baselines (DiffusionFWI and RED-DiffEq). Figure 3

Figure 3: Ground truth (left) vs diffusion-generated prior samples (right) across four OpenFWI dataset families.

DLO achieves superior or competitive performance under clean, noisy, and highly incomplete (missing-trace) data acquisition scenarios. Numerical results indicate that DLO consistently outperforms classical regularizers and matches or exceeds advanced diffusion-based regularization methods according to RMSE, MAE, and SSIM. Under extreme noise or acquisition sparsity, DLO maintains structural fidelity and avoids spurious artifacts prevalent in baseline results. Figure 4

Figure 4: Clean-data OpenFWI reconstruction comparisons showing faithful recovery of fine structure with DLO.

Figure 5

Figure 5: Quantitative comparison of normalized MAE, RMSE, and SSIM for all methods across OpenFWI families.

Robustness and Generalization

DLO demonstrates strong robustness to measurement noise and missing traces. Figure 6

Figure 6

Figure 6: DLO and baseline reconstructions under increasing Gaussian noise, showing maintenance of high-fidelity with DLO.

Figure 7

Figure 7

Figure 7: Reconstructions with increasing receiver removal; DLO's prior preserves geologically plausible results even with heavily subsampled data.

Uncertainty Quantification

Owing to its stochastic latent initializations, DLO enables natural ensemble-based uncertainty quantification. Pixelwise variability across latent initializations correlates strongly with prediction error, localizing model uncertainty to geologically ambiguous regions (often corresponding to faults or poorly illuminated subsurface zones). Figure 8

Figure 8: DLO ensemble sensitivity to latent initialization, highlighting uncertainty localization at faults.

Figure 9

Figure 9: Correlation between per-pixel ensemble standard deviation and absolute reconstruction error.

Large-Scale Out-of-Distribution Generalization

DLO is validated on Marmousi and SEG/EAGE Overthrust benchmarks—both out-of-distribution from the OpenFWI training set (notably for larger spatial domains and increased structural complexity). Without retraining, DLO delivers high-fidelity velocity inversions, capturing intricate faulting and stratigraphy, and maintains superiority over all tested baselines across varying initialization smoothness and increasing additive noise levels. Figure 10

Figure 10: DLO reconstructions on Marmousi, showing high-resolution recovery of complex geological features.

Figure 11

Figure 11: Qualitative Overthrust results; DLO preserves stratigraphic and fault complexity unseen in training.

Figure 12

Figure 12

Figure 12: Robustness sweep for Marmousi: DLO remains superior across varying initialization smoothness and noise.

Figure 13

Figure 13

Figure 13: Overthrust robustness: DLO exhibits minimal performance degradation with increased measurement noise.

Implementation

DDPM priors are trained per-family with a U-Net (as in Hugging Face Diffusers) and a standard VP linear beta schedule. The physical forward operator RθR_\theta0 is a fourth-order finite-difference acoustic wave solver with a realistic acquisition geometry. DLO alternates Adam updates for RθR_\theta1 (velocity field) and RθR_\theta2 (latent) variables over 300 iterations. The computational overhead of DLO is modest: per-iteration runtime is RθR_\theta32.2x standard FWI iteration, with moderate GPU memory consumption. No tuning of priors or re-initialization is performed for "transfer" experiments on Marmousi or Overthrust, demonstrating prior robustness.

Implications and Future Outlook

DLO presents a structured methodology for integrating high-fidelity diffusion-model priors with PDE-constrained inverse problems, offering improved inversion accuracy, robust uncertainty quantification, and substantial resilience to noise, incomplete acquisition, and initialization. The approach naturally admits extension to other high-dimensional inverse problems governed by physical constraints when pretrained generative priors are accessible. Notably, DLO preserves the practical advantages of physical-space initialization, which is essential in exploratory geophysics and other engineering contexts.

Future extensions include scaling to three-dimensional geometries, integration with more expressive or hybrid priors, adaptive penalty annealing, and application to additional scientific imaging modalities. DLO’s decoupling strategy may inform the design of general algorithms for combining generative models with physical inversion beyond FWI settings.

Conclusion

DLO establishes a new standard for physics-guided, generative-model-regularized FWI, consistently outperforming both classical and contemporary diffusion-based methods across challenging scenarios and demonstrating strong generalization without retraining. Its decoupled penalty approach bridges the gap between strict prior-consistency and physically-informed data optimization, achieving high-fidelity subsurface reconstructions critical for seismic imaging. The methodology is broadly applicable in the context of PDE-inverse problems with learnable priors, with clear theoretical justification and demonstrated practical effectiveness. Figure 14

Figure 14: Diffusion prior pretraining curve illustrating consistent noise-prediction loss convergence across training and validation.

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Collections

Sign up for free to add this paper to one or more collections.

Tweets

Sign up for free to view the 1 tweet with 10 likes about this paper.