Papers
Topics
Authors
Recent
2000 character limit reached

α-Scatter Median Matrix Estimation

Updated 10 December 2025
  • α-scatter median matrix is a robust, affine-equivariant estimator of scatter for multivariate data following an α-symmetric law.
  • It generalizes the classical scatter halfspace median using the α-norm geometry to achieve minimax-optimal concentration under contamination.
  • Key properties include a 50% breakdown point, affine invariance, and consistency, making it effective for heavy-tailed and non-elliptical distributions.

The α-scatter median matrix is a robust, affine-equivariant estimator of scatter (dispersion) for multivariate data distributed according to an α-symmetric law. Extending the core principles of halfspace depth to data whose geometry is governed by the α-norm, the α-scatter median matrix achieves minimax-optimal rates of concentration under contamination and incorporates invariance and breakdown properties suitable for multivariate heavy-tailed or non-elliptical data (Bočinec et al., 8 Dec 2025). This estimator generalizes the classical scatter halfspace median matrix, aligning the estimation process with the geometry inherent to α-symmetric distributions.

1. α-Symmetric Distributions and α-Norm Geometry

An α-symmetric distribution on Rd\mathbb{R}^d is defined by its characteristic function depending solely on the α-norm of its argument:

xα=(i=1dxiα)1/α,ψP(t)=E[eitTX]=φ(tα),tRd.\|x\|_\alpha= \Bigl(\sum_{i=1}^d |x_i|^\alpha \Bigr)^{1/\alpha},\quad \psi_P(t)=\mathbb{E}[e^{i t^T X}] = \varphi(\|t\|_\alpha),\quad t\in\mathbb{R}^d.

This family subsumes both elliptical distributions (α=2\alpha=2) and broader classes with heavy tails or non-Euclidean geometry. Key properties include:

  • For any unit vector uu, uTXduαX1u^T X \equiv_d \|u\|_\alpha X_1 (where X1X_1 is a coordinate of XX).
  • Assumption P({0})=0P(\{0\})=0 and d>1d>1 ensure smoothness and guarantee a unique location median at μhs=0\mu_\text{hs}=0 (Bočinec et al., 8 Dec 2025).

2. Classical and α-Scatter Halfspace Depths

The classical scatter halfspace depth for a candidate positive-definite matrix Σ\Sigma and distribution PP under elliptical symmetry (α=2) is:

SD(Σ;P)=infuSd1min{P(uT(Xμhs)uTΣu),  P(uT(Xμhs)uTΣu)}SD(\Sigma;P) = \inf_{u\in S^{d-1}} \min \left\{ P(u^T (X-\mu_\text{hs}) \leq \sqrt{u^T\Sigma u}),\; P(u^T(X-\mu_\text{hs}) \geq \sqrt{u^T\Sigma u}) \right\}

Maximizing this yields the scatter halfspace median. For α2\alpha \neq 2, quadratic forms do not respect the underlying geometry. The α-scatter halfspace depth is defined as:

SDα(Σ;P)=infuSd1min{P(uT(Xμhs)Σ1/2uα),  P(uT(Xμhs)Σ1/2uα)},SD_\alpha(\Sigma;P) = \inf_{u\in S^{d-1}} \min \left\{ P(u^T (X-\mu_\text{hs}) \leq \|\Sigma^{1/2} u\|_\alpha),\; P(u^T(X-\mu_\text{hs}) \geq \|\Sigma^{1/2} u\|_\alpha) \right\},

where Σ1/2\Sigma^{1/2} is the matrix square root in S+d\mathcal{S}_+^d and α\|\cdot\|_\alpha replaces the Euclidean norm (Bočinec et al., 8 Dec 2025). This formulation ensures that the depth contours align with directions imposed by α-symmetry.

3. Construction and Characterization of the α-Scatter Median Matrix

The α-scatter median matrix Σhs,α(P)\Sigma_{hs,\alpha}(P) is any maximizer of SDα(;P)SD_\alpha(\cdot;P):

Σhs,α(P)argmaxΣS+dSDα(Σ;P)\Sigma_{hs,\alpha}(P) \in \arg \max_{\Sigma \in \mathcal{S}_+^d} SD_\alpha(\Sigma;P)

Theorem 6.4 (Bočinec et al., 8 Dec 2025) demonstrates that for α-symmetric PP, the maximizer is spherical:

Σhs,α(P)=σ2Id,σ=F1(3/4)\Sigma_{hs,\alpha}(P) = \sigma^2 I_d, \quad \sigma = F^{-1}(3/4)

where FF denotes the common marginal distribution of X1X_1. Uniqueness and sphericity derive from affine and sign-permutation invariance, restricting maximizers to scalar multiples of the identity. The value σ\sigma is chosen to balance the probabilities in the halfspace depth definition, solving F(σd1/21/α)1/2=1F(σ)F(\sigma d^{1/2-1/\alpha}) - 1/2 = 1 - F(\sigma).

4. Robustness, Equivariance, and Breakdown Properties

The α-scatter median matrix retains several desirable robustness features:

  • Affine Equivariance: For nonsingular AA and any location μ\mu, Σhs,α(PAX+μ)=AΣhs,α(PX)AT\Sigma_{hs,\alpha}(P_{AX+\mu}) = A\Sigma_{hs,\alpha}(P_X) A^T.
  • Sign-Permutation Invariance: For α-symmetric PP, Σhs,α\Sigma_{hs,\alpha} is invariant under signed permutations.
  • ½ Breakdown Point: The use of probability minima over halfspaces ensures the estimator is robust against up to 50% contamination in the sample.
  • Continuity and Consistency: The α-sHD mapping (Σ,P)SDα(Σ;P)(\Sigma, P) \mapsto SD_\alpha(\Sigma;P) is jointly continuous, so sample maximizers converge almost surely to the population value (Bočinec et al., 8 Dec 2025).

5. Concentration and Robustness under Contamination

In the Huber ε-contamination model (P=(1ϵ)P+ϵQP' = (1-\epsilon)P + \epsilon Q), the sample α-scatter median Σ^hs,α,n\widehat{\Sigma}_{hs,\alpha,n} achieves minimax-optimal error bounds. With enough samples (nn large), there exist constants C1,C2>0C_1, C_2>0 such that for any δ(0,1/2)\delta \in (0,1/2):

P{supuSd1(Σ^hs,α,n1/2u)αuασC(ϵ+d/n+ln(1/δ)/n)}12δP\Bigg\{ \sup_{u\in S^{d-1}} \Bigg| \frac{(\widehat{\Sigma}_{hs,\alpha,n}^{1/2}u)_\alpha}{\|u\|_\alpha} - \sigma \Bigg| \leq C\left(\epsilon + \sqrt{d/n} + \sqrt{\ln(1/\delta)/n}\right) \Bigg\} \geq 1-2\delta

Thus, the estimator converges at rate O(ϵ+d/n)O(\epsilon + \sqrt{d/n}) (up to logδ\log\delta factors) in the pseudometric supu(Σ1/2uα/uαΣ1/2uα/uα)\sup_u(|\|\Sigma^{1/2}u\|_\alpha/\|u\|_\alpha - \|\Sigma'^{1/2}u\|_\alpha/\|u\|_\alpha|) (Bočinec et al., 8 Dec 2025).

6. Computational Procedure and Practical Implications

For α-symmetric laws, only scalar multiples of the identity matrix are candidates for Σhs,α(P)\Sigma_{hs,\alpha}(P). In practice:

  • One discretizes the sphere Sd1S^{d-1} to approximate the depth infimum across directions uu.
  • Empirical versions of the depth can be evaluated using plug-in sample statistics and quantile estimation for FF at $3/4$.

This approach efficiently estimates scatter in multivariate models where the underlying distribution deviates from elliptical symmetry, accommodating heavy tails and more complex dependency structures via the α-norm geometry.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Follow Topic

Get notified by email when new papers are published related to α-Scatter Median Matrix.