Papers
Topics
Authors
Recent
Search
2000 character limit reached

Fisher–Rao Geodesic Optimal Schedule

Updated 3 March 2026
  • Fisher–Rao–Geodesic Optimal Schedule is a method for interpolating probability densities with constant speed under the Fisher–Rao metric.
  • It unifies statistical manifold geometry, optimal transport (WF-δ), and generative model interpolation through explicit closed-form solutions.
  • The approach minimizes the total information length via optimal time reparametrization and offers efficient numerical implementations in imaging and diffusion models.

The Fisher–Rao–Geodesic Optimal Schedule designates any interpolation schedule, time reparametrization, or discretization scheme for probability densities or general nonnegative measures that realizes constant-speed motion (“geodesic”) under the Fisher–Rao metric or related metrics on the space of densities, measures, or measure-valued data. This paradigm unifies the Fisher–Rao geometry of statistical manifolds, nonlinear interpolation on density spaces, geodesic scheduling in generative models such as masked diffusion, the transport–source (“WF-δ”) interpolation between optimal transport and Fisher–Rao, and related structures for matrix-valued measures and varifold metamorphosis. The optimal schedule is defined as the unique parametrization for which the Fisher–Rao metric length is traversed at constant rate, minimizing the total “information length” or squared Fisher–Rao energy, and in all principal geometries of densities admits a closed-form solution.

1. Dynamical Formulation and Geodesic Equations

The Fisher–Rao metric provides a canonical Riemannian structure on the space of positive densities or measures. For general measures ρ0,ρ1\rho_0,\rho_1 on a convex domain ΩRd\Omega\subset\mathbb{R}^d, the interpolation between optimal transport and Fisher–Rao is given by the “WF-δ” functional (Chizat et al., 2015): WFδ2(ρ0,ρ1)=inf(ρ,v,g)1201Ω[v(t,x)2ρ(t,x)+δ2g(t,x)2ρ(t,x)]dxdt\mathrm{WF}_\delta^2(\rho_0,\rho_1) = \inf_{(\rho, v, g)} \frac{1}{2} \int_0^1 \int_\Omega \left[|v(t,x)|^2 \rho(t,x) + \delta^2 |g(t,x)|^2 \rho(t,x)\right]\, dx dt subject to the augmented continuity equation

tρ+x(ρv)=gρ,ρ(0)=ρ0,ρ(1)=ρ1,\partial_t \rho + \nabla_x \cdot (\rho v) = g \rho,\qquad \rho(0)=\rho_0,\, \rho(1)=\rho_1,

with vv the velocity field and gg the source (mass generation) term. Euler–Lagrange analysis yields optimality conditions: M=ρφ,Z=δ2ρφ,tφ+12(φ2+φ2δ2)=0,M = \rho \nabla\varphi,\qquad Z = \delta^2 \rho \varphi,\qquad \partial_t\varphi + \frac{1}{2}(|\nabla\varphi|^2 + \frac{\varphi^2}{\delta^2}) = 0, and the constant Hamiltonian ensures the energy is equidistributed between kinetic and Fisher–Rao components at constant rate. In the pure Fisher–Rao regime (v0v\equiv0), the explicit geodesic schedule is

ρ(t,x)=((1t)ρ0(x)+tρ1(x))2,\rho(t, x) = \left((1-t)\sqrt{\rho_0(x)} + t \sqrt{\rho_1(x)}\right)^2,

which is the globally minimizing Fisher–Rao geodesic (Chizat et al., 2015).

2. Fisher–Rao Geodesics and Optimal Parametrization

The classical Fisher–Rao metric on densities μ\mu on a compact manifold MM is

GμFR(α,β)=Mαμβμμ,G^{\rm FR}_\mu(\alpha, \beta) = \int_M \frac{\alpha}{\mu} \frac{\beta}{\mu} \mu,

where α,β\alpha, \beta are tangent vectors (signed measures with zero total mass for probability densities) (Bruveris et al., 2016). In square-root coordinates f=μf = \sqrt{\mu}, geodesics are straight lines in L2L^2: f(t)=(1t)f0+tf1,μ(t)=[f(t)]2.f(t) = (1-t) f_0 + t f_1, \qquad \mu(t) = [f(t)]^2. The Fisher–Rao distance is f1f0L2\| f_1 - f_0 \|_{L^2}, and the unique optimal schedule is the identity s=ts = t, since the speed in the Fisher–Rao metric is already constant along this path. For non-geodesic families ν(t)\nu(t), the arc-length reparametrization

s(t)=1d(ν(0),ν(1))0tν˙(τ)GFRdτs(t) = \frac{1}{d(\nu(0),\,\nu(1))} \int_0^t \left\| \dot{\nu}(\tau) \right\|_{G^{\rm FR}} d\tau

yields a schedule with constant Fisher–Rao speed (Bruveris et al., 2016), and sμ(t(s))s\mapsto \mu(t(s)) is an arc-length parameterization.

3. Fisher–Rao Optimal Schedules in Discretized and Statistical Models

In discretized and information-theoretic settings, the Fisher–Rao–geodesic schedule governs the reparametrization of time for processes on the statistical manifold of probability distributions. For the 1D family qt(x)q_t(x) of discrete distributions as in masked diffusion models, the Fisher–Rao line element is

ds2=x[tqt(x)]2qt(x)dt2=I(t)dt2,ds^2 = \sum_x \frac{[\partial_t q_t(x)]^2}{q_t(x)} dt^2 = I(t) dt^2,

where I(t)I(t) is the Fisher information in tt (Zhang, 6 Aug 2025). The unique Fisher–Rao geodesic schedule φ(s)\varphi(s) solves

I(φ(s))φ(s)=C,\sqrt{I(\varphi(s))} \varphi'(s) = C,

with solution, for I(t)=Nα˙t2/[αt(1αt)]I(t) = N \dot\alpha_t^2 / [\alpha_t (1-\alpha_t)] and αt\alpha_t monotonic, given by the closed-form “cosine schedule”

αφ(s)=cos2(sπ/2),\alpha_{\varphi(s)} = \cos^2(s\pi / 2),

with the stepwise discretization ti=φ(i/T)t_i = \varphi(i/T) for i=0,,Ti=0, \dotsc, T, ensuring each step has equal Fisher–Rao length (Zhang, 6 Aug 2025).

Alternative heuristics (linear or quadratic schedules in α\alpha or tt) are strictly suboptimal, incurring larger total information length, as shown by their explicit L>Λ(1)L > \Lambda(1) gap in geodesic length. The Fisher–Rao schedule equalizes information-geometric change per step, yielding optimal discrete approximation to the continuous Fisher–Rao geodesic.

4. Generalizations: LpL^p Fisher–Rao Schedules and Non-commutative Extensions

Extensions to LpL^p Fisher–Rao geometries use the pp-root map to globally linearize geodesic flow on the Fréchet Lie group $\Diff^{-\infty}(\mathbb{R})$: Ψp(φ)=p(φ1/p1),\Psi_p(\varphi) = p (\varphi'^{1/p} - 1), yielding constant-speed geodesics f(t)=(1t)f0+tf1f(t) = (1-t)f_0 + t f_1 in LpL^p, with density interpolant

ρt(x)=(φt(x))ρ0(φt(x)).\rho_t(x) = (\varphi_t'(x)) \rho_0(\varphi_t(x)).

For p=2p=2, this recovers the square-root Fisher–Rao geodesic (Lam, 7 Feb 2026). In the non-commutative matrix-valued setting, Fisher–Rao geodesics GtG_t evolve as

Gt=G01/2(G01/2G1G01/2)tG01/2,G_t = G_0^{1/2} (G_0^{-1/2} G_1 G_0^{-1/2})^t G_0^{1/2},

with constant “velocity” in the Fisher–Rao metric and linear interpolation in the logarithmic coordinates. The optimal ε\varepsilon-entropic schedule in Schrödinger-type problems consists in adding Fisher–Rao heat flow near endpoints proportional to h(t)=εmin{t,1t}h(t) = \varepsilon\, \min\{t, 1-t\} (Monsaingeon et al., 2020).

5. Schedules in Metamorphosis and Varifold Metrics

The Fisher–Rao–geodesic schedule arises in the LDDMM–Fisher–Rao framework for varifold metamorphosis, where point masses (Diracs) or geometric measures interpolate via weight scaling and drift: r(t)=((1t)r0+tr1)2,x(t)=x0+(x1x0)t.r(t) = \left((1-t)\sqrt{r_0} + t\sqrt{r_1}\right)^2, \quad x(t) = x_0 + (x_1-x_0)t. Geodesic schedules are explicitly computable in this framework for both spatial and weight evolution, and the optimal schedule achieves constant-speed motion in the composite Riemannian metric (Hsieh et al., 2021).

6. Numerical Schemes and Implementation

Efficient computation of Fisher–Rao–geodesic optimal schedules utilizes proximal-splitting or shooting methods adapted to the metric’s structure. For the WF-δ metric, the staggered-grid discretization enforces the coupled transport–source continuity equation using Douglas–Rachford or least-squares projections, while local proximal operators update state variables by solving decoupled cellwise cubics (Chizat et al., 2015). Similarly, geodesic shooting in LDDMM-FR metamorphosis integrates forward ODEs for state variables, backward for adjoints, and employs gradient-based optimization constrained to constant-speed schedules, with practical implementation relying on kernel acceleration or automatic differentiation as in recent frameworks (Hsieh et al., 2021).

7. Limiting Behavior and Structural Properties

The structural interpolation in the WF-δ metric recovers Fisher–Rao as δ0\delta \to 0 (pure source, no transport), and quadratic Wasserstein as δ\delta \to \infty (pure transport, no creation). The Fisher–Rao metric space of densities is isometric to the positive orthant of the sphere in L2L^2; geodesics are globally unique, and the geodesic schedule traverses constant arc-length in this geometry (Bruveris et al., 2016).

These structures ensure stability and completeness of geodesic schedules for optimal information-theoretic transport and for geometric applications in medical imaging, probabilistic modeling, and statistical inference. The closed-form geodesic schedule, in each geometric setting, provides both practical numerical advantages and conceptual clarity for optimal schedule design in generative and statistical models.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Fisher–Rao–Geodesic Optimal Schedule.