Schrödinger Bridge Problem

Updated 28 January 2026

Schrödinger Bridge is a framework that finds the most likely stochastic evolution between two endpoint distributions via KL divergence minimization.
It connects stochastic control, entropy-regularized optimal transport, and diffusion processes, with explicit Gaussian solutions available in special cases.
Algorithmic methods such as Fortet–Sinkhorn iterations and mirror descent enable effective high-dimensional generative modeling and robust statistical estimation.

The Schrödinger Bridge (SB) problem occupies a central role at the intersection of stochastic control, optimal transport, and information theory. Conceived originally in the context of large deviations and statistical physics, the SB problem seeks the most likely stochastic evolution (path measure) that matches two prescribed endpoint distributions, given a prior random evolution, by minimizing Kullback–Leibler divergence to the prior. Recent advances have propelled SB to the forefront of modern machine learning and generative modeling, both for its mathematical richness and its algorithmic connections to diffusion models, flow matching, and entropy-regularized optimal transport.

1. Problem Formulation and Duality

Given two probability distributions $\mu_0$ and $\mu_1$ on $\mathbb{R}^d$ , and a reference path measure $Q$ (typically a reversible linear diffusion on $C([0,1];\mathbb{R}^d)$ ), the dynamic SB problem is: $\min_{P} ~ \mathrm{KL}(P \| Q) \quad \text{s.t.} \quad P_0 = \mu_0,~P_1 = \mu_1,$ where $P$ ranges over all path measures with time-$0$ and time-$1$ marginals given by $\mu_0$ and $\mu_1$ (Bunne et al., 2022, Gushchin et al., 2023).

Through Girsanov's theorem and Fokker–Planck analysis, this dynamic optimization is equivalent to a stochastic control problem that minimizes expected control energy while enforcing marginal constraints, and to the static entropy-regularized optimal transport (EOT) problem: $\min_{\pi\in\Pi(\mu_0,\mu_1)} \int\!\frac{1}{2}\|x-y\|^2\,\pi(dx,dy) + \varepsilon\, \mathrm{KL}(\pi\|\mu_0\otimes\mu_1),$ where $\varepsilon$ denotes the entropic regularization parameter corresponding to the variance of the reference diffusion (Gushchin et al., 2023, Bunne et al., 2022). The equivalence between dynamic (path-space) and static (coupling) SB is well established: the optimal path measure has endpoint coupling matching the EOT plan, and conditionally, is a bridge of the reference process.

The solution is characterized by forward and backward Schrödinger potentials (or scaling functions), which solve coupled Kolmogorov equations or, equivalently, nonlinear Fredholm integral (Sinkhorn-like) equations.

2. Closed-Form Solutions: Gaussian Case and Riemannian Geometry

For Gaussian marginals and linear reference diffusions with drift $f_t(x) = D_t x + \alpha_t$ and volatility $g_t$ , the SB solution remains within the class of Gaussian Markov processes. The time-dependent mean $m_t$ and covariance $C_t$ admit explicit formulas: $\begin{aligned} m_t &= r_t^c\,\mu + r_t\,\mu' + \zeta_t - r_t\,\zeta_1, \ C_t &= (r_t^c)^2\,\Sigma + r_t^2\,\Sigma' + r_t\,r_t^c (C^*_\varepsilon + {C^*_\varepsilon}^\top) + \kappa_t(1-\eta_t)I, \end{aligned}$ where $r_t$ , $r_t^c$ , $\zeta_t$ , $\kappa_t$ , $\eta_t$ are explicit functions of $g_t$ , $\alpha_t$ , and $t$ ; $C^*_\varepsilon$ is the entropic-OT coupling covariance (Bunne et al., 2022). The time-dependent drift is given in closed form in terms of these quantities, making simulation via SDE integrators (e.g., Euler–Maruyama) or transition kernels tractable.

These closed forms follow from a Riemannian-geometric reduction: restricting Otto's infinite-dimensional Wasserstein calculus to Gaussian densities identifies $(m_t, C_t)$ with a geodesic on the manifold of symmetric positive-definite matrices ( $S_{++}^d$ ), endowed with the Bures–Wasserstein metric. The SB action is a finite-dimensional Lagrangian with an explicit Euler–Lagrange (matrix ODE) system, integrable in terms of Lyapunov operators and generator theory for Gaussian Markov processes (Bunne et al., 2022).

3. Computational Methods and Algorithmics

Numerous computational strategies for the SB problem have emerged, each leveraging distinct aspects of its structure:

Fortet–Sinkhorn Iteration: For continuous or sample-based marginals, one iterates nonlinear updates for the Schrödinger potentials, which are provably contractive in Hilbert's projective metric (Pavon et al., 2018, Teter et al., 2023).
Iterative Markovian Fitting (IMF): Alternates between Markovian and reciprocal projections in path-space. For each step, the Markov projection is implemented by fitting a neural drift via supervised regression (bridge-matching) on simulated endpoints, while the reciprocal projection replaces process bridges by conditioned reference paths (Shi et al., 2023, Howard et al., 20 Jun 2025).
Data-Driven and Score-Based Approaches: When access to only samples of $\mu_0$ , $\mu_1$ is available, data-driven SB alternates constrained MLE steps and importance sampling for potential propagation (Pavon et al., 2018). Score matching, both in continuous and discrete time, is central in empirical SB and SGM unification (Chen et al., 2021, Jing et al., 22 Mar 2025, Tang et al., 25 Aug 2025).
Mirror Descent and Simulation-Free Variants: Recent advances recast Sinkhorn or SB as variational online mirror descent (OMD) in measure space with regret and convergence guarantees. Approximations over a Gaussian mixture variational family yield simulation-free, robust procedures (Han et al., 3 Apr 2025).

A summary of algorithmic categories and features is provided below:

Method	Core Operation	Notable Features
Fortet–Sinkhorn	Coupled potential scaling	Classical, contractive, simple
Data-driven SB	Sample-based MLE/regression	High-dimensional, sample efficient
IMF / DSBM	Markov/reciprocal path fits	Compatible with deep nets
OMD / VMSB	Mirror descent in coupling space	No simulation, robust to noise

Simulation of the SB process in practice leverages the Markov-Gaussian property (explicit transition kernels) or SDE integration (time-dependent drift), with initialization via generative diffusion model fits yielding significant empirical gains (Bunne et al., 2022, Tang et al., 25 Aug 2025).

4. Relation to Entropic OT, Diffusion Models, and Extensions

The SB problem provides the dynamic (path-space) analogue of static entropic OT. The zero-noise limit ( $\varepsilon\to 0$ ) recovers the classical $2$-Wasserstein geodesic; positive $\varepsilon$ yields a stochastic interpolation smoothing out the deterministic OT path (Bunne et al., 2022, Gushchin et al., 2023).

Connections to generative modeling are especially pronounced:

Learning SDE drifts by likelihood or score matching in SB generalizes SGM training objectives. In unconstrained limits, SB reduces to standard SGMs (Chen et al., 2021, Jing et al., 22 Mar 2025).
Pretraining with score-based diffusion models or initializing via denoiser flows improves SB training speed and accuracy (Tang et al., 25 Aug 2025).

Generalizations include:

Multi-marginal SB: Enforcing constraints at multiple times, with efficient algorithms (MSBM) that composite local solution steps (Park et al., 18 Oct 2025).
Tree-structured SB and Barycenters: Generalization to trees (e.g., for Wasserstein barycenter computation) via edge-wise decoupled SB steps (Howard et al., 20 Jun 2025).
Soft-constrained SB (SSB): KL-penalized endpoint constraints yielding solutions that interpolate between reference and target (Garg et al., 2024).
Topological SB: SB formulated over graphs and simplicial complexes, with Laplacian-driven reference processes and neural drift parametrizations exploiting graph topology (Yang, 7 Apr 2025).
Discrete-time and GAN-based SB Matching: Substituting SDEs with learned finite-step transition kernels, compatible with discrete GAN architectures (Gushchin et al., 2024).

5. Theoretical Guarantees, Benchmarks, and Convergence

Theoretical properties of SB formulations and algorithms include:

Existence and Uniqueness: Under mild integrability, there exists a unique dynamic SB solution matching prescribed marginals, both in path-space and in static (coupling) formulations (Bunne et al., 2022, Gushchin et al., 2023, Howard et al., 20 Jun 2025).
Contractivity and Convergence: Sinkhorn-type and dynamic Sinkhorn recursions in SB are contractive in Hilbert's projective metric, with explicit contraction coefficients computable from support geometry, controllability Gramian, and noise (Teter et al., 2023).
$\Gamma$ -Convergence: CNF-based SB solvers with terminal constraint penalties provably converge to the SB solution as the penalty increases (Jing et al., 22 Mar 2025).
Mirror Descent Regret: OMD formulations for SB achieve sublinear regret bounds and almost sure convergence in measure space (Han et al., 3 Apr 2025).
Projection-based Algorithms: IMF and variants admit KL monotonicity and global convergence guarantees by generalized Pythagorean identities and uniqueness of SB as Markov–reciprocal process with correct marginals (Shi et al., 2023, Park et al., 18 Oct 2025).

Benchmarking studies rely on analytic mixtures, Gaussian benchmarks, and high-dimensional tasks (e.g., SB potentials built from log-sum-exp quadratic functionals). Key metrics include Bures–Wasserstein distances (BW $_2$ ), FID, and conditional errors, emphasizing both transport/geodesic and generative accuracy (Gushchin et al., 2023). Neural methods matched or outperformed classical algorithms on high-dimensional non-Gaussian and image benchmarks, provided sufficient time discretization and careful solver design.

6. Applications and Limitations

Applications of SB span:

Generative modeling: Sampling from interpolated or synthetic distributions using explicit SB SDE drift (Bunne et al., 2022, Chen et al., 2021).
Single-cell genomics: Interpolation between populations, trajectory inference with improved stability over earlier methods (Bunne et al., 2022, Park et al., 18 Oct 2025).
Topological signal processing: Distribution matching and interpolation on graphs, brain networks, and flow fields (Yang, 7 Apr 2025).
Benchmarking of OT solvers: Ground-truth analytic construction of EOT/SB plans for neural model evaluation, revealing model performance disparities in complex regimes (high dimension, multimodality, large/small entropic regularization) (Gushchin et al., 2023).
Barycentre computation: Extension of SB flow-matching to tree-structured multi-marginal OT and entropic Wasserstein barycentres (Howard et al., 20 Jun 2025).

Prominent limitations include:

Stiffness and instabilities as $\varepsilon\to 0$ (low stochasticity), which can challenge SDE or neural solvers (Gushchin et al., 2023).
Need for high discretization granularity and many inner-outer iterations for accurate marginal fitting in challenging geometries.
Parametric flexibility and architecture choice remain essential, with failures reported for approaches relying on insufficient sampling or poor initialization (Gushchin et al., 2023, Tang et al., 25 Aug 2025).

7. Outlook and Current Directions

Ongoing directions for SB research include:

Improved solver stability and computational scaling (e.g., mirror descent, tree-structured splits, simulation-free variational approaches) for high-dimensional and multi-marginal settings (Han et al., 3 Apr 2025, Park et al., 18 Oct 2025, Howard et al., 20 Jun 2025).
Integration of pre-trained diffusion models for initialization and transfer, merging the strengths of SGMs and SB (Tang et al., 25 Aug 2025).
Robustness under data noise, online or streaming updates (variational OMD), and adaptive algorithms for evolving data distributions (Han et al., 3 Apr 2025).
Extension to non-Euclidean and topological domains, leveraging graph neural architectures and Laplacian-driven diffusions (Yang, 7 Apr 2025).
Benchmarking and interpretability tools that expose the distinction between sample generation and accurate transport recovery (Gushchin et al., 2023).
Theoretical analysis of contraction, regularity, and convergence rates, particularly for non-standard noise models and irregular supports (Teter et al., 2023).

The SB problem stands as a mathematically rigorous and algorithmically fertile paradigm bridging optimal transport, diffusion processes, and deep generative modeling. Its recent unification with neural computation and score-based generative models has extended its applicability and deepened its theoretical landscape (Bunne et al., 2022, Shi et al., 2023, Chen et al., 2021, Tang et al., 25 Aug 2025, Gushchin et al., 2023).