Papers
Topics
Authors
Recent
Search
2000 character limit reached

Riemannian Brownian Motion Prior

Updated 5 March 2026
  • Riemannian Brownian Motion Prior is a probabilistic model defined on paths over Riemannian manifolds, generalizing Euclidean Brownian motion by incorporating manifold geometry via the heat kernel.
  • It underpins nonparametric Bayesian regression and generative modeling, ensuring that inference on manifold-valued data respects intrinsic curvature and topology.
  • Discretized implementations use piecewise-geodesic paths and retraction-based approximations, guaranteeing convergence to the continuous Brownian motion as the mesh size decreases.

A Riemannian Brownian motion prior is a probability measure on paths or random variables valued in a Riemannian manifold, canonically induced by the Brownian motion associated with the Laplace–Beltrami operator. Such priors generalize the classical Gaussian prior (Euclidean Brownian motion) to nonlinear spaces and are governed by the manifold’s geometry. They underlie inference for manifold-valued data, nonparametric Bayesian regression with geometric targets, and generative modeling in machine learning, such as manifold-structured variational autoencoders. The prior is fundamentally determined by the heat kernel on the manifold, encoding curvature, topology, and local volume form.

1. Mathematical Definition and Construction

Let (M,g)(M, g) be a compact DD-dimensional Riemannian manifold with Riemannian volume μ\mu and Laplace–Beltrami operator Δ\Delta. The Riemannian Brownian motion XtX_t on MM is the diffusion whose generator is 12Δ\tfrac12 \Delta and which solves the Stratonovich stochastic differential equation (SDE)

dXt=i=1DVi(Xt)dBti,X0=x,dX_t = \sum_{i=1}^D V_i(X_t) \circ dB^i_t, \quad X_0 = x,

where {V1,,VD}\{V_1, \ldots, V_D\} is any local orthonormal frame and BtiB^i_t are independent standard Brownian motions (Wang et al., 2015, Lee et al., 22 Oct 2025).

The fundamental solution to the heat equation,

tu(t,x)=12Δxu(t,x),u(0,x)=δy(x),\partial_t u(t,x) = \tfrac12\,\Delta_x u(t,x), \quad u(0,x) = \delta_y(x),

is the heat kernel pt(x,y)p_t(x,y), which is the transition density at time tt of Brownian motion from xx to yy (Wang et al., 2015). Spectrally,

pt(x,y)=k=0eλktϕk(x)ϕk(y),p_t(x, y) = \sum_{k=0}^\infty e^{-\lambda_k t} \phi_k(x)\phi_k(y),

where {ϕk}\{\phi_k\} are the eigenfunctions and {λk}\{\lambda_k\} the eigenvalues of Δ-\Delta.

The canonical Brownian motion prior on the path-space C([0,1],M)C([0,1],M) is constructed by prescribing finite-dimensional distributions using ptp_t, then extending uniquely to a measure via Kolmogorov’s extension theorem (Wang et al., 2015).

2. Discretized Priors and Practical Implementation

In inference settings, continuous Brownian motion is discretized for computation. A piecewise-geodesic path with mesh size h>0h>0 is represented by evaluating f(0),f(h),,f(1)f(0), f(h), \ldots, f(1). The discretized Brownian prior assigns a density with respect to μ(1/h+1)\mu^{\otimes (1/h+1)}: πh(f)=1μ(M)k=11/hph(f((k1)h),f(kh)).\pi_h(f) = \frac{1}{\mu(M)} \prod_{k=1}^{1/h} p_h(f((k-1)h), f(kh)). This construction is equivalent to sampling a geodesic random walk with transition kernels given by the heat kernel at each step (Wang et al., 2015, Schwarz et al., 2022).

Efficient sampling can be achieved by replacing the exact exponential map with second-order retractions, providing a local approximation at a fraction of the computational cost. When the retraction is accurate to O(v3)O(\|v\|^3), the resulting random walk converges in law (in the Skorokhod topology) to the Brownian motion as h0h \to 0 (Schwarz et al., 2022).

For embedded or implicit manifolds, projection-based retractions allow for fast computation of proposal points, and for Lie groups, group-exponential or Cayley-type retractions offer efficient alternatives.

Theoretical results ensure that as the mesh size h0h \to 0, the law of the piecewise-geodesic path converges in distribution to the law of the continuous Brownian path on MM (Wang et al., 2015, Schwarz et al., 2022).

3. Posterior Consistency and Contraction for Manifold-Valued Regression

In nonparametric Bayesian regression on manifolds, consider data (ti,xi)(t_i, x_i), where predictors ti[0,1]t_i \in [0,1] and responses xix_i in a compact Riemannian manifold MM, modeled as

xitipσ2(f0(ti),xi),x_i \mid t_i \sim p_{\sigma^2}(f_0(t_i), x_i),

for some (unknown) Lipschitz function f0:[0,1]Mf_0 : [0,1] \to M and fixed σ2\sigma^2 (Wang et al., 2015).

The Riemannian Brownian motion prior is used on the space of regression functions: the prior on C([0,1],M)C([0,1], M) is the law of a Brownian path. For practical inference, one employs a discretized Brownian motion prior on piecewise-geodesic paths, as above.

The main statistical result establishes that the posterior contracts around the true regression function f0f_0 at rate O(n1/4+ϵ)O(n^{-1/4+\epsilon}) (for any small ϵ>0\epsilon>0) in LqL_q metrics: $d_q(f, g) = \left( \int_0^1 \dist_M(f(t), g(t))^q p(t) dt \right)^{1/q}.$ Consistency is verified for all 1q<1 \le q < \infty for discretized priors, and weak consistency for the continuous path-space prior (Wang et al., 2015).

Proof techniques rely on metric-entropy for Hölder sieves, concentration of prior mass (via small-heat-kernel balls), and Kullback–Leibler neighborhood arguments, leveraging the equivalence between Hellinger/KL distances for induced data densities and sup-distances in path space.

4. Geometric and Algorithmic Aspects

The stochastic differential equation on MM can be interpreted in both Stratonovich and Itô forms. On an intrinsic Riemannian manifold, the SDE

dxt=12i=1n(eiei)(xt)dt+i=1nei(xt)dWtid x_t = -\tfrac12 \sum_{i=1}^n (\nabla_{e_i} e_i)(x_t) dt + \sum_{i=1}^n e_i(x_t) \circ dW^i_t

generates Brownian motion with generator 12ΔLB\tfrac12\Delta_{\rm LB}, where \nabla is the Levi–Civita connection (Lee et al., 22 Oct 2025). The Stratonovich drift is geometrically tied to the divergence of the vector fields spanning the tangent space. Conversion to the Itô form removes the explicit drift; Itô Brownian motion lacks a drift but accumulates "curvature-induced" drift through the Stratonovich-to-Itô correction.

In the context of embedded submanifolds or specific geometric structures (Lie groups, submanifolds) the drift and frame-fields encode the response of Brownian motion to curvature, mean-curvature, or group coadjoint structure (Lee et al., 22 Oct 2025).

5. Extensions to Manifold-Learned and Data-Driven Settings

When the ambient prior is not Euclidean but manifold-valued (e.g., in latent variable models), one assigns a Brownian motion prior based on the induced metric g(z)g(z) from a decoder mapping f(z)f(z) in a VAE architecture. The Brownian motion prior generalizes the classic Gaussian prior via the Riemannian heat kernel: p(z)(2πtp)N/2exp(dg(μp,z)22tp)detg(z),p(z) \propto (2 \pi t_p)^{-N/2} \exp\left(-\frac{d_g(\mu_p,z)^2}{2t_p}\right) \sqrt{\det g(z)}, where dg(,)d_g(\cdot,\cdot) is the geodesic distance for the pull-back metric (Kalatzis et al., 2020).

In variational inference, differences in log-densities render normalization constants unnecessary, and Riemannian reparameterization enables gradient-based optimization. Computational cost arises mainly from metric inversion and geodesic computations per sample, but the approach is practical for moderate dimensions. Empirical results on image datasets show improved likelihood estimates and latent space regularization compared to Euclidean Gaussian or learned mixture priors, notably at low latent dimensions (Kalatzis et al., 2020).

6. Generalizations and Convergence of Random Walks

Beyond Riemannian cases, geodesic random walks in Finsler or more general geometric settings converge to limiting diffusions governed by a Riemannian-type Laplace operator. For a Finsler manifold (M,F)(M, F) of bounded geometry and a smooth family of tangent-space distributions {νp}\{\nu_p\}, geodesic random walks (scaled and centered) converge to a limiting diffusion with generator

A=12ΔgA+B,A = \frac12 \Delta^{\mathbf{g}_A} + B,

where gA\mathbf{g}_A is the symmetric part of the covariance induced by {νp}\{\nu_p\}, and BB is a drift term. When νp\nu_p are centered and isotropic, the limit is classical Riemannian Brownian motion. This establishes the robustness of the Riemannian Brownian prior as the canonical limit in a broad class of geometric random walks (Ma et al., 2021).

7. Applications and Empirical Properties

Riemannian Brownian motion priors play a key role in:

  • Nonparametric regression on manifolds, yielding posterior contraction rates and weak/strong consistency results in metric function spaces (Wang et al., 2015).
  • Variational autoencoders with manifold-valued latent spaces, resulting in improved model capacity, better fit to latent data manifolds, and sharper, geometrically valid generative samples (Kalatzis et al., 2020).
  • Latent-trajectory and time-series modeling, where the prior ensures sample paths stay on the manifold, and the law is efficiently sampled via discrete or retraction-based geodesic walks (Schwarz et al., 2022).
  • Bayesian smoothing and filtering in systems evolving on MM, where the prior is combined with data likelihood via state-space models, and inference leverages the Brownian heat kernel or its discretization (Lee et al., 22 Oct 2025).

A plausible implication is that the ubiquity of the Riemannian Brownian motion prior in modeling arises from its invariance property, geometric consistency, and the limiting behavior of a wide range of discrete random walks even in non-Riemannian small-time regimes (Ma et al., 2021). The prior’s contraction properties scale with the mesh size and ambient geometry, with computational bottlenecks primarily in geodesic or metric inversion, but feasible via well-adapted numerical schemes.

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Riemannian Brownian Motion Prior.