Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 174 tok/s
Gemini 2.5 Pro 51 tok/s Pro
GPT-5 Medium 38 tok/s Pro
GPT-5 High 34 tok/s Pro
GPT-4o 91 tok/s Pro
Kimi K2 205 tok/s Pro
GPT OSS 120B 438 tok/s Pro
Claude Sonnet 4.5 36 tok/s Pro
2000 character limit reached

Intermediate Sinkhorn Iterations

Updated 28 October 2025
  • Intermediate Sinkhorn iterations are a sequence of partial algorithmic updates that progress toward the entropy-regularized optimal transport solution while revealing continuous flow dynamics.
  • They bridge discrete matrix scaling steps with continuum limits, exemplified by parabolic Monge-Ampère equations and mirror gradient flows.
  • The iterations offer statistically optimal and computationally efficient approximations that underpin adaptive stopping criteria and error bounds in stochastic control tasks.

Intermediate Sinkhorn iterations are the sequence of partial updates generated by the Sinkhorn (or iterative proportional fitting) algorithm before convergence to its regularized optimal transport solution. These iterates play a vital role in both the analysis and practical computation of entropy-regularized optimal transport (OT), matrix scaling, and related stochastic control problems. Theoretical and computational studies have revealed that, under various regimes, intermediate Sinkhorn iterates exhibit structured dynamical properties, fast convergence, statistical optimality, and even admit continuum limits that connect to parabolic Monge-Ampère equations and mirror gradient flows in Wasserstein space.

1. Dynamical Trajectories of Sinkhorn Iterates

At each iteration, the Sinkhorn algorithm alternately updates scalings to approach prescribed marginals—yielding a sequence of couplings or scaling vectors whose marginals or potentials evolve discretely toward OT equilibrium. In the canonical entropic OT problem,

minπΠ(μ,ν)12xy2dπ(x,y)+εKL(πμν)\min_{\pi \in \Pi(\mu, \nu)} \int \tfrac12\|x-y\|^2\,d\pi(x,y) + \varepsilon\,\mathrm{KL}(\pi\mid \mu\otimes\nu)

the sequence of XX-marginals (or projections) from iterated Sinkhorn updates forms a discrete trajectory (ρkε)(\rho_k^\varepsilon). As the entropic regularization parameter ε0\varepsilon \to 0 and iteration count scales as kt/εk \sim t/\varepsilon, this sequence fills out an absolutely continuous curve in Wasserstein space P2(Rd)\mathcal{P}_2(\mathbb{R}^d) termed the Sinkhorn flow (Deb et al., 2023). The continuous limit is characterized by a continuity equation

tρt+(ρtvt)=0,\frac{\partial}{\partial t}\rho_t + \nabla\cdot(\rho_t v_t) = 0,

where the velocity field vtv_t is determined by the dual potential structure (the mirror coordinate, related to the Brenier map): vt(x)=(2ut(x))1x(f(x)+logρt(x)),v_t(x) = -(\nabla^2 u_t(x))^{-1} \nabla_x(f(x) + \log \rho_t(x)), with utu_t a convex potential such that (ut)#ρt=ν(\nabla u_t)_\#\rho_t = \nu.

2. Parabolic Monge-Ampère Equations and Continuum Limits

Intermediate Sinkhorn iterates, in suitable scaling limits, approximate the time-discretized solution to nonlinear parabolic equations of Monge-Ampère type. For quadratic cost and sufficiently regular data, the evolution of the potential is governed by

tut(x)=f(x)g(ut(x))+logdet2ut(x),\frac{\partial}{\partial t} u_t(x) = f(x) - g(\nabla u_t(x)) + \log\det \nabla^2 u_t(x),

where ff and gg are the negative log-densities of source and target. This equation, identified in various works (Berman, 2017, Deb et al., 2023), links the dynamics of Sinkhorn’s dual updates to geometric flows (e.g., Ricci-Kähler flow for the torus case). Practically, after mktm \sim k t iterations, the interpolated Sinkhorn potentials approximate the PDE solution ut(x)u_t(x) to within O(1/k)O(1/k), exhibiting uniform and explicit error bounds (Berman, 2017).

3. Statistical and Computational Properties of Intermediate Iterates

Intermediate iterates are not merely theoretical constructs: they are statistically and algorithmically optimal for a wide class of downstream tasks. In Schrödinger bridge estimation from finite samples, the squared total variation error between the law defined by kk Sinkhorn steps and the ideal bridge decomposes as O(1/m+1/n+r4k)O(1/m + 1/n + r^{4k}), where mm, nn are the sample sizes and r(0,1)r \in (0,1) a function of regularization and domain (Maeda et al., 26 Oct 2025). The first two terms are statistical estimation error floors; only the third is optimized by further Sinkhorn updates, and it decays exponentially. This provides principled guidance for early stopping: computational effort in increasing kk beyond

klog(1/m+1/n)4logrk \gtrsim \frac{\log(1/m + 1/n)}{4\log r}

is wasted, since further error is limited by data availability.

For matrix scaling, intermediate Sinkhorn steps converge in error metrics such as 1\ell_1, 2\ell_2 and KL-divergence at rates that are explicitly characterized (Chakrabarty et al., 2018, He, 13 Jul 2025). In dense matrices (γ>1/2\gamma > 1/2), error decreases logarithmically in nn and 1/ε1/\varepsilon, with upper complexity O(lognlogε)O(\log n - \log \varepsilon) iterations; for sparse matrices (γ<1/2\gamma < 1/2), convergence is polynomially slower, demonstrating a sharp phase transition (He, 13 Jul 2025). These rates allow certification of the accuracy of intermediate solutions without full convergence.

4. Analytical Structure and Flow Interpretations

Under the vanishing regularization limit (ε0\varepsilon \to 0), intermediate Sinkhorn iterates fill out a continuous "mirror flow" in Wasserstein space, minimizing the functional F(ρ)=KL(ρef)F(\rho) = \mathrm{KL}(\rho\,||\,e^{-f}) with mirror 12W22(ρ,ν)\tfrac12 W_2^2(\rho, \nu). The velocity norm along this flow corresponds to the metric derivative in the linearized optimal transport (LOT) metric, not in the canonical W2W_2 distance. The norm is given as

vtL2(ρt)=limδ01δLOTeg(ρt+δ,ρt),\|v_t\|_{L^2(\rho_t)} = \lim_{\delta \to 0} \frac{1}{\delta}\, \mathrm{LOT}_{e^{-g}}(\rho_{t+\delta}, \rho_t),

where LOTeg2(ρt+δ,ρt)=Tt+δ(y)Tt(y)2dν(y)\mathrm{LOT}^2_{e^{-g}}(\rho_{t+\delta}, \rho_t) = \int |T_{t+\delta}(y) - T_t(y)|^2\, d\nu(y) and TtT_t is the optimal transport map from ege^{-g} to ρt\rho_t (Deb et al., 2023). This positions the path of Sinkhorn iterates as geodesic in LOT geometry.

The flow admits an associated McKean-Vlasov stochastic differential equation

dXt=(fut(Xt)gut(Xtut)+htut(Xt))dt+2(2ut(Xt))1dBt,dX_t = \left( -\frac{\partial f}{\partial u_t}(X_t) - \frac{\partial g}{\partial u_t}(X_t^{u_t}) + \frac{\partial h_t}{\partial u_t}(X_t) \right) dt + \sqrt{2 (\nabla^2 u_t(X_t))^{-1}}\, dB_t,

whose marginals evolve according to Sinkhorn flow, revealing links to nonlinear diffusions and stochastic control.

5. Exponential Convergence and Regularity Conditions

Exponential convergence of intermediate iterates to the exact regularized OT or Schrödinger bridge solution holds under regularity and convexity assumptions. Essential conditions include:

KL(ρef)12αI(ρef),\mathrm{KL}(\rho\|e^{-f}) \le \frac{1}{2\alpha} I(\rho|e^{-f}),

where II is the relative Fisher information.

  • Uniform lower bounds on the determinant of Hessians of the intermediate potentials, infx,tdet2ut(x)c0>0\inf_{x,t} \det \nabla^2 u_t(x) \geq c_0 > 0.

These guarantee exponential decay in KL divergence,

KL(ρtef)KL(ρ0ef)ect,\mathrm{KL}(\rho_t\|e^{-f}) \leq \mathrm{KL}(\rho_0\|e^{-f}) e^{-ct},

and thus in Wasserstein distance W2(ρt,ef)=O(ect/2)W_2(\rho_t,e^{-f}) = O(e^{-ct/2}), for some c>0c>0 (Deb et al., 2023). Pointwise exponential rates also hold for iterates and their gradients on the torus, with constants tied to geometric parameters (Greco et al., 2023).

6. Applications and Broader Implications

Intermediate Sinkhorn iterates are practically significant as computationally efficient, provably accurate, and statistically optimal approximations. They inform solver design for high-dimensional OT, dynamic stochastic control, Schrödinger bridge problems, and generative modeling. The structural convergence results justify the adoption of early stopping or adaptive iteration schedules, especially in large-scale and sample-based scenarios, and underpin the analysis of variance reduction, numerical stability, and computational cost. By connecting discrete iterates to known geometric PDEs, mirror descent flows, and stochastic dynamics, this body of work reveals the foundational role of intermediate Sinkhorn iterates in both the analysis and practical implementation of entropic regularization and OT-based algorithms.


Conceptual Focus Main Result or Formula
Sinkhorn flow (scaling limit) ρtt+(ρtvt)=0\frac{\partial \rho_t}{\partial t} + \nabla\cdot(\rho_t v_t) = 0, vt=(2ut)1(f+logρt)v_t = -(\nabla^2 u_t)^{-1} \nabla(f + \log \rho_t), (ut)#ρt=ν(\nabla u_t)_\#\rho_t = \nu
Parabolic Monge-Ampère evolution utt(x)=f(x)g(ut(x))+logdet2ut(x)\frac{\partial u_t}{\partial t}(x) = f(x) - g(\nabla u_t(x)) + \log\det \nabla^2 u_t(x)
LOT metric derivative vtL2(ρt)=limδ01δLOTeg(ρt+δ,ρt)\|v_t\|_{L^2(\rho_t)} = \lim_{\delta\to 0} \frac{1}{\delta} \mathrm{LOT}_{e^{-g}}(\rho_{t+\delta},\rho_t)
Statistical error bound (Schr. bridge) E[TV2(P,,[0,τ],Pm,nk,[0,τ])]=O(1m+1n+r4k)\mathbb{E}\left[\mathrm{TV}^2(P_{\infty,\infty}^{*,[0,\tau]}, P_{m,n}^{k,[0,\tau]})\right] = O\left(\frac{1}{m}+\frac{1}{n}+r^{4k}\right), r<1r<1, kk= #Sinkhorn
Exponential convergence (KL) KL(ρtef)KL(ρ0ef)ect\mathrm{KL}(\rho_t \| e^{-f}) \le \mathrm{KL}(\rho_0 \| e^{-f}) e^{-ct}
McKean-Vlasov (Sinkhorn diffusion) dXt=(fut(Xt)gut(Xtut)+htut(Xt))dt+2(2ut(Xt))1dBtdX_t = \left(-\frac{\partial f}{\partial u_t}(X_t) - \frac{\partial g}{\partial u_t}(X_t^{u_t}) + \frac{\partial h_t}{\partial u_t}(X_t)\right)dt + \sqrt{2(\nabla^2 u_t(X_t))^{-1}}\,dB_t

In summary, intermediate Sinkhorn iterations constitute a rich mathematical object, governing the transient dynamics between arbitrary initialization and regularized OT equilibrium, and their scaling and continuum limits elucidate deep geometric, analytic, and statistical properties of entropic optimal transport algorithms (Deb et al., 2023, Maeda et al., 26 Oct 2025, Berman, 2017, Greco et al., 2023, He, 13 Jul 2025).

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Intermediate Sinkhorn Iterations.