Papers
Topics
Authors
Recent
Search
2000 character limit reached

Riemannian Geometry for Compound Gaussian Models

Updated 27 January 2026
  • Riemannian geometry for compound Gaussian distributions is a framework that uses Fisher information metrics to define smooth parameter manifolds for models with sample-dependent scaling.
  • It enables closed-form geodesic paths, efficient distance computations, and effective gradient-based optimization for change detection and classification tasks.
  • The approach leverages matrix and vector exponentials to manage complex invariances and decoupling between scatter shape and texture in signal processing applications.

Riemannian geometry for compound Gaussian distributions establishes a differential-geometric framework for the statistical analysis, optimization, and inference with compound Gaussian and related non-centered mixtures of scaled Gaussian models. These families incorporate heteroscedasticity through sample-dependent scaling (or “texture” variables), producing parameter spaces that are smooth manifolds equipped with natural Riemannian metrics derived from Fisher information. This geometry enables the definition of geodesics, distance functions, barycenters, and recursive estimation algorithms, which are utilized in change detection, classification, and other signal processing tasks (Bouchard et al., 2020, Collas et al., 2022).

1. Compound Gaussian Family and Parameter Manifold

A pp-dimensional complex compound Gaussian model draws each observation by scaling a zero-mean complex Gaussian vector with a positive random or deterministic texture variable: xiCN(0,τiΣ),τi>0,ΣHp++\mathbf x_i \sim \mathcal{CN}(0, \tau_i \Sigma),\qquad \tau_i > 0, \quad \Sigma \in \mathcal H_p^{++} where Hp++\mathcal H_p^{++} denotes the cone of p×pp \times p Hermitian positive-definite matrices. Identifiability is enforced by the unit-determinant constraint

ΣSHp++={ΣHp++:detΣ=1},\Sigma \in \mathcal{SH}_p^{++} = \{\Sigma \in \mathcal H_p^{++}: \det\Sigma = 1\},

leading to the parameter manifold

Mp,n=SHp++×R++n\mathcal M_{p,n} = \mathcal{SH}_p^{++} \times \mathbb R_{++}^n

with real dimension (p(p+1)/21)+n(p(p+1)/2 - 1) + n (Bouchard et al., 2020).

In the “non-centered mixture of scaled Gaussian” (NC-MSG) variant,

xi=μ+τiyi,yiN(0,Σ),x_i = \mu + \sqrt{\tau_i} y_i,\qquad y_i \sim \mathcal N(0, \Sigma),

with μRp\mu \in \mathbb R^p, ΣSymp+\Sigma \in \mathrm{Sym}_p^+, and iτi=1\prod_i \tau_i = 1 for identifiability, providing a manifold

Θ=Rp×Symp+×{τR++n:i=1nτi=1}\Theta = \mathbb R^p \times \mathrm{Sym}_p^+ \times \{\tau \in \mathbb R_{++}^n: \prod_{i=1}^n \tau_i = 1\}

of dimension p+p(p+1)2+(n1)p + \frac{p(p+1)}{2} + (n-1) (Collas et al., 2022).

2. Fisher Information Metric Structure

The Fisher information defines a Riemannian metric on the parameter manifold of any parametric statistical model. For the compound Gaussian,

ξ,ηθ=1pξΣ,ηΣΣ+1nξτ,ηττ\langle \xi, \eta \rangle_\theta = \frac{1}{p} \langle \xi_\Sigma, \eta_\Sigma \rangle_{\Sigma} + \frac{1}{n} \langle \xi_\tau, \eta_\tau \rangle_{\tau}

with

ξΣ,ηΣΣ=tr(Σ1ξΣΣ1ηΣ),ξτ,ηττ=(ξττ1)T(ηττ1)\langle \xi_\Sigma, \eta_\Sigma \rangle_{\Sigma} = \mathrm{tr}(\Sigma^{-1} \xi_\Sigma \Sigma^{-1} \eta_\Sigma),\qquad \langle \xi_\tau, \eta_\tau \rangle_\tau = (\xi_\tau \odot \tau^{-1})^\mathrm{T} (\eta_\tau \odot \tau^{-1})

where \odot denotes entrywise product (Bouchard et al., 2020).

For NC-MSG, the Fisher–Rao metric is

gθ((δμ,δΣ,δτ),(δμ,δΣ,δτ))=i=1n1τiδμTΣ1δμ+n2Tr(Σ1δΣΣ1δΣ)+p2i=1nδτiτiδτiτig_\theta((\delta\mu,\delta\Sigma,\delta\tau), (\delta\mu',\delta\Sigma',\delta\tau')) = \sum_{i=1}^n \frac{1}{\tau_i} \delta\mu^\mathrm{T} \Sigma^{-1} \delta\mu' + \frac{n}{2} \mathrm{Tr}(\Sigma^{-1} \delta\Sigma\, \Sigma^{-1} \delta\Sigma') + \frac{p}{2}\sum_{i=1}^n \frac{\delta\tau_i}{\tau_i} \frac{\delta\tau_i'}{\tau_i}

with tangent spaces projected to enforce the unit-product constraint (Collas et al., 2022).

3. Geodesics and Distance Computation

For the centered CG manifold Mp,n\mathcal{M}_{p,n}, geodesics decouple across factors:

  • Closed-form (initial velocity):

Σ(t)=Σexp(tΣ1ξΣ),τ(t)=τexp(tτ1ξτ)\Sigma(t) = \Sigma\, \exp(t\,\Sigma^{-1}\xi_\Sigma),\qquad \tau(t) = \tau \odot \exp(t\,\tau^{-1} \odot \xi_\tau)

  • Closed-form (endpoint, for constant-speed geodesic):

Σ(t)=Σ01/2(Σ01/2Σ1Σ01/2)tΣ01/2,τ(t)=τ0(1t)τ1t\Sigma(t) = \Sigma_0^{1/2} \big(\Sigma_0^{-1/2} \Sigma_1 \Sigma_0^{-1/2} \big)^t \Sigma_0^{1/2},\qquad \tau(t) = \tau_0^{\odot(1-t)}\odot \tau_1^{\odot t}

These curves satisfy γ˙γ˙=0\nabla_{\dot\gamma}\dot\gamma=0 for the product Levi-Civita connection (Bouchard et al., 2020).

For models with non-zero location μ\mu, such as NC-MSG, the full geodesic equations—including all variables—do not have a closed form. Instead, second-order retractions are employed: Rθ(tξ)=(μ+tδμ+t22aμ,  Σ+tδΣ+t22aΣ,  τ+tδτ+t22aτ)R_\theta(t\xi) = \big(\mu + t\,\delta\mu + \tfrac{t^2}{2} a_\mu,\; \Sigma + t\,\delta\Sigma + \tfrac{t^2}{2} a_\Sigma,\; \tau + t\,\delta\tau + \tfrac{t^2}{2} a_\tau\big) with (aμ,aΣ,aτ)(a_\mu, a_\Sigma, a_\tau) chosen to preserve manifold constraints (Collas et al., 2022).

The Riemannian squared distance for centered CG is

d2(θ0,θ1)=1plog(Σ01/2Σ1Σ01/2)F2+1nlog(τ01τ1)22d^2(\theta_0,\theta_1) = \frac{1}{p} \|\log(\Sigma_0^{-1/2} \Sigma_1 \Sigma_0^{-1/2})\|_F^2 + \frac{1}{n} \|\log(\tau_0^{-1} \odot \tau_1)\|_2^2

reflecting distances in the affine-invariant metric on SHp++\mathcal{SH}_p^{++} and log-Euclidean structure on textures (Bouchard et al., 2020).

4. Geometric Properties and Invariances

Both component manifolds are Riemannian symmetric spaces: (SHp++,,)(\mathcal{SH}_p^{++}, \langle\cdot,\cdot\rangle) is nonpositively curved (isomorphic to SL(p)/SU(p)SL(p)/SU(p)), and (R++n,,)(\mathbb R_{++}^n, \langle\cdot,\cdot\rangle) is flat. The product geometry is geodesically complete and primarily nonpositively curved. The Fisher–Rao metric is invariant under congruence transformations (induced by SL(p)SL(p) on Σ\Sigma) and texture rescalings. The orthogonality between scatter shape and texture reflects their statistical decoupling (Bouchard et al., 2020). For NC-MSG models, explicit formulas for sectional curvature are not generally available, and even in the Gaussian case with nonzero location parameter, curvature expressions evade closed-form representation (Collas et al., 2022).

5. Computation: Algorithms and Gradient Methods

Numerical algorithms for geodesics and distances leverage matrix and vector exponentials. Each geodesic evaluation on Mp,n\mathcal{M}_{p,n} requires one matrix exponential (or SVD/eigen-decomposition) and one vector exponential, with log-determinant and entrywise logarithm operations for distance computation (Bouchard et al., 2020).

For optimization (e.g., recursive detection, maximum likelihood, and barycenter computation), the Riemannian geometry yields efficient gradient-descent algorithms:

  • Natural (Riemannian) gradient: Given cost h(θ)h(\theta) on Θ\Theta, the Riemannian gradient is obtained via I(θ)1h(θ)I(\theta)^{-1}\nabla h(\theta), with an orthogonal projection onto the tangent space as required by parameter constraints.
  • Retraction-based updates: Parameter updates are performed via Rθk(αkgradh(θk))R_{\theta_k}(-\alpha_k\,\mathrm{grad}\,h(\theta_k)) with step-size αk\alpha_k selected by Armijo backtracking (Collas et al., 2022).

For NC-MSG, the processes of regularized maximum likelihood estimation and symmetrized KL barycenter minimization both employ the Fisher–Rao Riemannian gradient and (second-order) retraction. Each iteration has complexity O(np2+p3)O(np^2 + p^3), and convergence is attained in $5$–$60$ iterations, substantially outpacing naive steepest descent or ad-hoc metric approaches (Collas et al., 2022).

6. Kullback–Leibler Divergence and Statistical Barycenters

A closed-form Kullback–Leibler divergence is available for NC-MSG distributions. Given p1p_1 and p2p_2 parameterized by (μ1,Σ1,τ(1))(\mu_1, \Sigma_1, \tau^{(1)}) and (μ2,Σ2,τ(2))(\mu_2, \Sigma_2, \tau^{(2)}), the divergence is

DKL(p1p2)=12i=1n[τi(1)τi(2)Tr(Σ21Σ1)+1τi(2)(μ2μ1)TΣ21(μ2μ1)p]+12i=1nlogΣ2(τi(2))pΣ1(τi(1))pD_{KL}(p_1 \| p_2) = \frac{1}{2} \sum_{i=1}^n \left[ \frac{\tau_i^{(1)}}{\tau_i^{(2)}} \mathrm{Tr}(\Sigma_2^{-1} \Sigma_1) + \frac{1}{\tau_i^{(2)}} (\mu_2 - \mu_1)^T \Sigma_2^{-1} (\mu_2 - \mu_1) - p \right] + \frac{1}{2} \sum_{i=1}^n \log \frac{|\Sigma_2| (\tau_i^{(2)})^p}{|\Sigma_1| (\tau_i^{(1)})^p}

A symmetrized version, δ(θ1,θ2)=12(DKL(p1p2)+DKL(p2p1))\delta(\theta_1, \theta_2) = \frac{1}{2}(D_{KL}(p_1\|p_2) + D_{KL}(p_2\|p_1)), serves as a divergence for barycenter computation. The barycenter (Fréchet mean) is computed by minimizing the average symmetrized divergence, also using Riemannian gradient descent with retraction (Collas et al., 2022).

7. Applications and Practical Implications

Riemannian geometry for compound Gaussian models underpins recursive change detection, with algorithms leveraging the explicit geodesic and exponential map for low-complexity updates of sufficient statistics, e.g., in Constant False Alarm Rate (CFAR) detectors, where the overall per-block update cost is O(p3+n)O(p^3 + n) (Bouchard et al., 2020). For non-centered models, the KL divergence enables nearest-centroid classification that remains robust under affine transformations due to model invariances. Numerical experiments, such as on hyperspectral time series (e.g., Breizhcrops), demonstrate high accuracy and computational efficiency, with Fisher–Rao RGD converging significantly faster and using less computation per step than non-geometric or ad-hoc alternatives (Collas et al., 2022).

These geometric approaches have been integrated and contextualized within a lineage of Riemannian statistics on symmetric space models, recursive estimation, and SPD matrix analysis (e.g., Moakher, Bhatia, Smith, Zhou & Said). Ongoing research addresses existence and uniqueness criteria for regularized Riemannian objectives and seeks closed-form results for curvature and geodesics in more general parameter settings (Bouchard et al., 2020, Collas et al., 2022).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Riemannian Geometry for Compound Gaussian Distributions.