Riemannian Score-Based Generative Models
- Riemannian Score-Based Generative Models are frameworks that extend diffusion methods to complex data manifolds using intrinsic and score-derived Riemannian metrics.
- They integrate stochastic differential equations with geometric constructs such as geodesics and the Laplace–Beltrami operator to perform interpolation, sampling, and editing.
- Empirical evaluations show that RSGMs achieve competitive performance in image interpolation and generative tasks on both synthetic and structured manifolds.
Riemannian Score-Based Generative Models (RSGMs) generalize diffusion-based generative modeling beyond the Euclidean setting by endowing data spaces with nontrivial Riemannian metrics derived from either intrinsic data geometric structure or from the learned score function produced by neural score networks. This approach enables the principled treatment of interpolation, sampling, and generation of data lying on complex manifolds—whether these are known a priori (e.g., spheres, rotation groups, or mesh surfaces) or are implicitly defined by the learned density of powerful diffusion models. RSGMs combine the stochastic analysis of noising and denoising SDEs on manifolds with the geometric machinery of Riemannian metrics, geodesics, and associated differential operators.
1. Geometric Foundations of RSGMs
RSGMs operate on a smooth, possibly high-dimensional Riemannian manifold . The Riemannian metric is a smoothly varying family of inner products on each tangent space , represented in local coordinates by a symmetric positive definite matrix . The metric induces a volume form , a gradient operator , divergence , and the Laplace–Beltrami operator . These structures underlie both the forward diffusion ("noising") and the reverse-time generative ("denoising") processes in RSGMs (Bortoli et al., 2022).
A prototypical modeling choice is to let the manifold be a submanifold of ambient space: for some smooth map of full rank on . The tangent space at is , with the projection operator (Liu et al., 7 May 2025).
2. Riemannian Metrics Derived from the Score Function
For models in which data lies in with an unknown, complex structure (e.g., images), RSGMs define a data-space metric using the score function . Two main constructions are found in recent work:
- Score Jacobian Metric: , where . In coordinates,
This metric compresses directions with low change in the score and expands those with sharp score variations, aligning geodesics with natural semantic changes on the data manifold (Saito et al., 28 Apr 2025).
- Stein Score Metric Family: , , where . This metric leaves tangential directions to the score unchanged and stretches normal directions, effectively encoding the data manifold geometry implicitly learned by the diffusion model (Azeglio et al., 16 May 2025).
These score-derived metrics are symmetric positive definite when the score is nondegenerate and produce meaningful geometric structure for downstream tasks such as interpolation, extrapolation, and image editing.
3. Forward and Reverse Diffusions on Manifolds
The general RSGM paradigm involves a forward process (diffusion), which gradually corrupts data with noise, and a reverse-time process that generatively reconstructs data samples. On a Riemannian manifold, the forward SDE can be written as
where is manifold Brownian motion and is a (possibly time-dependent) scalar or matrix controlling the noise level. For compact manifolds, pure Brownian noising (drift , ) is common; for noncompact manifolds, an Ornstein–Uhlenbeck drift or a Brownian bridge may be more appropriate (Bortoli et al., 2022).
The time-reversal of the diffusion yields a generative process: with the so-called Riemannian score . In submanifold settings, all drift and diffusion terms are projected onto the tangent space using (Liu et al., 7 May 2025).
Score approximation is by neural networks , trained with denoising score matching (DSM) or sliced score matching (SSM) loss: where the transition kernel may be tractable only in a few special cases.
4. Numerical Methods: Geodesic Computation and Interpolation
A distinguishing feature of recent data-space RSGMs is the explicit use of geodesics in the score-based metric for interpolation, editing, and exploration. Given a Riemannian metric , geodesics minimize the path energy
subject to boundary conditions. The geodesic equations involve Christoffel symbols: with explicit expressions for depending on the local behavior of the score (Saito et al., 28 Apr 2025, Azeglio et al., 16 May 2025).
In practical high-dimensional settings, geodesics are discretized, and the length or energy functional is minimized numerically. Energy-based optimization (using gradient-based solvers such as Adam and, in some cases, Riemannian Adam for normalization/transport) is favored over explicit ODE integration. Regularization by the variance of Euclidean step-lengths is often included to prevent path collapse (Saito et al., 28 Apr 2025).
For interpolation, real or generated data points are first encoded into the noise space (e.g., by DDIM inversion), geodesics are computed under the current metric, and the intermediates are decoded back to data space. For extrapolation, manifold-aware walks are performed by initializing a momentum along the last geodesic segment and updating with a convex combination of the momentum and the local score, enabling plausible traversal beyond empirical data (Azeglio et al., 16 May 2025).
5. Empirical Results and Comparisons
Experimental evaluations have demonstrated that RSGMs achieve superior or competitive performance to previous generative and interpolation schemes on diverse manifolds and high-dimensional data.
- Image Interpolation: On Stable Diffusion and MNIST, geodesic interpolation in the score-based metric yields smoother, more realistic, and semantically coherent transitions compared to linear (Lerp/Slerp) or latent-noise correction methods (NAO, NoiseDiffusion). Quantitatively, RSGMs match or outperform baselines on metrics such as MSE, LPIPS, DreamSim, and CLIP-based scores; for instance, the highest "Reality" (0.716), "Noisiness" (0.818), and competitive "Fidelity" (0.810) scores on Stable Diffusion were observed for the proposed RSGM approach (Saito et al., 28 Apr 2025).
- Synthetic and Structured Data: On synthetic spheres (), RSGMs nearly recover ground-truth geodesics with relative error dropping from 4.9% (no geometric term) to 0.07% for large metric strength () (Azeglio et al., 16 May 2025). On Rotated MNIST, improvements in PSNR and SSIM favor the geodesic method.
- Generative Modeling on Known Manifolds: For explicit Riemannian manifolds (e.g., Earth data on , SO(10), or molecular configuration spaces), RSGMs achieve NLL and sample-quality competitive with or exceeding mixtures of Kent, Riemannian CNF, Moser flows, and Riemannian flow models (Bortoli et al., 2022, Liu et al., 7 May 2025).
A summary of typical quantitative results is given in the table below, highlighting image interpolation metrics on Stable Diffusion (four test examples) (Saito et al., 28 Apr 2025):
| Method | MSE [] | LPIPS [] | Reality | Noisiness | Fidelity |
|---|---|---|---|---|---|
| Lerp | 7.89 | 1.71 | 0.389 | 0.406 | 0.686 |
| Slerp | 7.89 | 1.71 | 0.704 | 0.765 | 0.784 |
| NAO | 64.04 | 6.03 | 0.617 | 0.766 | 0.815 |
| NoiseDiff | 14.13 | 2.35 | 0.600 | 0.646 | 0.783 |
| Proposed RSGM | 7.89 | 1.71 | 0.716 | 0.818 | 0.810 |
6. Extensions and Algorithmic Variants
RSGMs have been extended and adapted in several directions:
- Riemannian Schrödinger Bridge: The Riemannian Diffusion Schrödinger Bridge (RDSB) generalizes classical Schrödinger bridge algorithms to non-Euclidean settings, enabling entropic interpolation between distributions on manifolds. Alternating optimization of forward and backward drifts via iterative proportional fitting (IPF) reduces the number of simulation steps required and enhances sample quality, particularly for interpolation tasks (Thornton et al., 2022).
- Projection Schemes for General Submanifolds: Riemannian Denoising Diffusion Probabilistic Models (RDDPMs) leverage only the level-set definition and its derivatives to implement forward and reverse SDEs on general submanifolds, circumventing the need for explicit geodesic, eigenfunction, or heat kernel computations. Score approximation is conducted with projected neural networks, and stepwise projection via Newton's method enforces manifold constraints (Liu et al., 7 May 2025).
- Conditional/Richer Geodesics and Sampling: Potential extensions include Riemannian Langevin dynamics for curvature-aware generation, class-conditional or text-guided geodesics via additional forcing, and the application of frame-wise geodesic parallel transport for tasks such as video editing (Saito et al., 28 Apr 2025).
- Open Mathematical and Computational Challenges: Key directions for future research include efficient approximation of geometric quantities (e.g., Christoffel symbols, divergence corrections) in high dimension, metric extension to the full time-dependent SDE, and the analysis of manifold singularities arising from degenerate scores (Saito et al., 28 Apr 2025).
7. Computational Complexity, Limitations, and Outlook
RSGMs incur additional computational overhead relative to Euclidean SGMs due to the need for geometric operations (metric calculations, projections, geodesic shooting), but complexity is generally dominated by the cost of score network evaluation and optimization steps. For submanifold projection methods, the chief cost is Newton iterations for constraint satisfaction per SDE step (Liu et al., 7 May 2025). Sampling costs scale as or , where is latent dimension, is data dimension, and is the number of discretization steps.
Limitations include the tractability of heat kernels or transition densities for generic manifolds, sensitivity to discretization choices, and the challenges posed by manifolds with boundary or singularity loci. Empirically, RSGMs achieve robust sample quality across a range of data modalities and geometries; a plausible implication is that score-based metrics capture semantically meaningful variations aligned with the true data manifold.
In summary, Riemannian Score-Based Generative Models synthesize differential geometry and modern generative modeling, providing a principled framework for generative tasks and interpolation on complex data manifolds, both known and implicitly learned (Bortoli et al., 2022, Saito et al., 28 Apr 2025, Azeglio et al., 16 May 2025, Liu et al., 7 May 2025, Thornton et al., 2022).