Geometric Median-of-Means Estimation
- Geometric Median-of-Means is a robust multivariate location estimator that combines the geometric median's resilience with the bias-variance balance of the median-of-means method.
- It computes block means and uses their geometric median to achieve exponential concentration and sub-Gaussian error rates under minimal moment or tail assumptions.
- The estimator naturally extends to high-dimensional, Banach, and metric spaces, ensuring robust performance even in heavy-tailed and adversarial noise environments.
The geometric median-of-means (GMOM) estimator is a robust multivariate location estimator that generalizes the classical univariate median-of-means principle to metric and Banach space valued data. It combines the resilience of the geometric median with the bias-variance trade-offs of the median-of-means, achieving exponentially concentrated error bounds under minimal moment or tail assumptions. The GMOM framework extends naturally to high-dimensional, Banach, and metric spaces, including settings with heavy-tailed or adversarial noise, and is accompanied by non-asymptotic statistical guarantees, efficient computation algorithms, and inference tools.
1. Construction of the Geometric Median-of-Means
Given independent observations in (or in a general Banach or metric space), the GMOM estimator forms by partitioning the sample indices into disjoint blocks of size and computing the block means: The GMOM estimator of the mean is the geometric median of the block means: This extends to general metric spaces , where the estimator minimizes the maximum distance required to "defeat" more than half the block means, formalized through a tournament characterization (Minsker et al., 2023, Yun et al., 2022, Minsker, 2013).
2. Statistical Guarantees and Deviation Inequalities
GMOM achieves sub-Gaussian-type error rates and exponential tail bounds under weak moment or curvature assumptions. In Euclidean settings, with covariance , the following deterministic and probabilistic guarantees hold:
- Bias Bound: For absolutely continuous with mean and geometric median ,
Under log-concave or Gaussian distributions, the bias is , independent of (Minsker et al., 2023).
- Finite-Sample Error Rate: For suitable and , with probability at least ,
Choosing yields nearly sub-Gaussian rates even under heavy tails (Minsker et al., 2023).
- Banach Space Deviation: If weakly concentrated estimators exist with , the GMOM satisfies
for any , with explicit constants (Minsker, 2013).
- Metric Space Extension: In non-positively curved (NPC/CAT(0)) spaces and under entropy conditions, the GMOM achieves
with explicit , depending only on the geometry and VC entropy of the metric space (Yun et al., 2022).
3. Algorithmic Considerations
- Euclidean/ Hilbert Space: The geometric median is computed via Weiszfeld’s algorithm, defined iteratively by
with linear convergence under non-degeneracy (Minsker, 2013).
- Quadratic Growth: The median objective
satisfies a local quadratic growth condition. Specifically, for near the median ,
for explicit constants and , inducing sharp convergence criteria for first-order optimization methods (Minsker et al., 2023).
- Stopping Rule: A practical criterion for halting gradient-based algorithms is
guaranteeing -approximation of the geometric median (Minsker et al., 2023).
- Banach/Metric Space: Subgradient or smoothing algorithms are applicable, and in general metric spaces, one may use minimum enclosing ball (centerpoint) and “median tournament” algorithms, but generic polynomial-time solvers are only available in Euclidean settings (Yun et al., 2022, Minsker, 2013).
4. High-Dimensional and Inference Framework
- Bahadur Representation: For ultrahigh-dimensional with under sub-exponential tails and moment regularity, the spatial median (and thus GMOM of block means) admits an expansion
with for rectangles , facilitating simultaneous confidence intervals and global testing (Cheng et al., 2023).
- Multiplier Bootstrap: Introduces random sign multipliers for empirical spatial medians, enabling law approximation and valid inferences in high-dimensions with no explicit estimation of variance structures (Cheng et al., 2023).
- Multiple Testing: FDR-controlling procedures and global tests using coordinatewise statistics derived from the spatial or GMOM estimators are established under weak dependence and high-dimensional scaling (Cheng et al., 2023).
5. Extensions, Robustness, and Non-Euclidean Settings
GMOM applies to problems in Banach spaces and infinite-dimensional settings with only the requirement of weak block-level concentration. The construction admits finite adversarial contamination, maintaining exponential concentration by adjusting the effective (Minsker, 2013). In NPC metric spaces (e.g., manifolds, tree spaces), GMOM is defined as the center of the smallest ball defeating more than half the blocks, and achieves exponential deviation bounds and error rates, in contrast to polynomial rates for empirical means (Yun et al., 2022).
The table below summarizes the generalization scope:
| Space | GMOM Definition | Statistical Guarantee |
|---|---|---|
| Geometric median of means | Dimension-free, sub-Gaussian tails | |
| Banach/Hilbert space | Geometric median in | Exponential deviation, |
| NPC/CAT(0) metric space | Median-of-means tournament | Exponential tail, explicit constants |
Outlier robustness follows from the geometric median’s resistance properties; adversarial contamination in up to fraction of blocks can be tolerated at bounded cost (Minsker, 2013). Parallelizability arises naturally from block autonomy.
6. Applications and Empirical Behavior
Practical applications include mean and location estimation under heavy-tailed distributions, sparse linear regression, low-rank matrix recovery, and empirical studies with real datasets such as log-returns of financial assets. GMOM achieves tighter risk bounds than the coordinatewise median or empirical mean in these regimes:
- Empirical Performance: Synthetic high-dimensional data confirms predicted dimension dependence of local median curvature () (Minsker et al., 2023).
- Financial Data: On S&P 500 daily log-returns, GMOM outperforms coordinatewise median and sample mean in both small-sample and heavy-tailed scenarios, with error bounds validated empirically (Minsker et al., 2023).
- Statistical Inference: Large-scale genomic studies employ GMOM with multiplier bootstrap for simultaneous confidence interval construction and FDR-controlled multiple testing, demonstrating validity even as grows exponentially with (Cheng et al., 2023).
7. Limitations and Open Problems
Known computational limitations arise primarily in general metric spaces, where no fully polynomial-time implementation exists outside finite-dimensional Euclidean settings (Yun et al., 2022). Fast algorithms for the “median tournament” step in arbitrary NPC spaces remain an open problem. In Banach space extensions, controlling the constants in deviation inequalities and addressing non-unique medians for non-strictly convex norms are ongoing challenges (Minsker, 2013).
GMOM requires solving convex but non-smooth optimization problems—smoothed relaxations (e.g., Charbonnier loss) and tailored stopping rules mitigate computational difficulties (Minsker et al., 2023). In practice, block sizes and number, as well as weakest-block variance, must be chosen to satisfy requisite moment or concentration properties for the guarantees to hold.
Geometric median-of-means estimation offers a unifying framework for robust, high-dimensional, and distribution-free mean estimation, with broad applicability and theoretically grounded performance guarantees in both Euclidean and non-Euclidean domains (Minsker et al., 2023, Cheng et al., 2023, Minsker, 2013, Yun et al., 2022).