Papers
Topics
Authors
Recent
2000 character limit reached

Esseen's Anti-Concentration Bound in Tensors

Updated 27 November 2025
  • Esseen's anti-concentration bound is a precise quantitative measure of the Kolmogorov distance between a normalized tensor sum and a Gaussian distribution.
  • The framework leverages intrinsic parameters such as oscillation, partial-sum seminorms, and correlation measures to extend classical limit theorems to high-dimensional and degenerate settings.
  • The approach unifies results from Esseen, Bolthausen, and Barbour–Chen by using exchangeable-pair coupling and Stein’s method to provide robust error estimates.

The anti-concentration bound of Esseen describes a quantitative, non-asymptotic estimate for the Kolmogorov distance between a normalized statistic and the Gaussian, highlighting the extent to which a linear combination of symmetric, exchangeable random variables (or tensors) avoids being too concentrated around any single value. The work of Dodos–Tyros (Dodos et al., 2022) establishes sharp anti-concentration (Berry–Esseen-type) bounds for sums of the form S=θ,X=i[n]dθiXiS = \langle \theta, X \rangle = \sum_{i\in[n]^d} \theta_i X_i, where XX is a random tensor with strong symmetry, and provides explicit error terms in terms of intrinsic parameters. This framework recovers and extends classical results—such as Esseen’s and Bolthausen’s theorems—to arbitrary tensor order dd, encompassing high-dimensional and degenerate regimes and elucidating the transition between classical independence and combinatorial dependence structures.

1. Framework and Statement of the Anti-Concentration Bound

Let n,dNn,d \in \mathbb{N}; X=(Xi:i[n]d)X = (X_i : i \in [n]^d) a real-valued random tensor; and θ=(θi:i[n]d)\theta = (\theta_i : i\in[n]^d) a deterministic tensor (vector of coefficients). The main object is the sum

S=θ,X=i[n]dθiXiS = \langle \theta, X \rangle = \sum_{i\in[n]^d} \theta_i X_i

Let σ2=Var(S)\sigma^2 = \operatorname{Var}(S) and suppose a nondegeneracy condition on a parameter δ1\delta_1:

δ1>max{osc(X)α,  Bα,  (κn)α}\delta_1 > \max \left\{ \operatorname{osc}(X)^{\alpha},\; B^\alpha,\; \left(\frac{\kappa}{n}\right)^\alpha \right\}

for some α(0,1)\alpha \in (0,1), with κ=20d318d(2d)!\kappa = 20d^3 18^d (2d)! and BB, osc(X)\operatorname{osc}(X) as defined below.

The anti-concentration (Kolmogorov) bound [Theorem 1.4; (Dodos et al., 2022)] is:

dK(S,N(0,σ2))E1+E2+E3d_K(S, N(0, \sigma^2)) \leq E_1 + E_2 + E_3

with E1E_1, E2E_2, E3E_3 given explicitly by: E1=5osc(X)1α+5δ01α+δ0(θ021)d2δ1+6κn1α+4θ02/n E2=236E[X(1,,d)3]δ13/2j=1ni(1)=jθi3 E3=3κdδ1s=2d(ds)s!Σs+16d22d/nθs\begin{align*} E_1 &= 5\,\operatorname{osc}(X)^{1-\alpha} + 5\,|\delta_0|^{1-\alpha} + \left| \frac{\delta_0(\|\theta\|_0^2 - 1)}{d^2 \delta_1} \right| + \frac{6\kappa}{n^{1-\alpha} + 4\|\theta\|_0^2 / n} \ E_2 &= \frac{2^{36} \mathbb{E}[|X_{(1,\ldots,d)}|^3]}{ \delta_1^{3/2} \cdot \sum_{j=1}^n \left| \sum_{i(1)=j} \theta_i \right|^3 } \ E_3 &= \frac{3\kappa}{ d \sqrt{\delta_1} \cdot \sum_{s=2}^d \binom{d}{s} \sqrt{s!} \sqrt{ \Sigma_s + 16 d^2 2^d / n }\, \|\theta\|_s } \end{align*}

2. Parameter Definitions

The error terms depend on several intrinsic parameters and seminorms:

  • ss-Partial Sum Seminorm: For 0sd0\leq s \leq d,

θs=(j[n]s(ji[n]dθi)2)1/2\|\theta\|_s = \left( \sum_{j\in[n]^s} \left( \sum_{j\sqsubseteq i\in[n]^d} \theta_i \right)^2 \right)^{1/2}

Special cases: θ0=iθi\|\theta\|_0 = |\sum_{i}\theta_i| (total sum), θd=θ2\|\theta\|_d = \|\theta\|_2 (Euclidean norm).

  • Exchangeability/Correlation Parameters δt\delta_t: For 0td0 \leq t \leq d,

δs=E[X(1,,d)X(1,,s,d+1,,2ds)]\delta_s = \mathbb{E}\left[ X_{(1,\ldots,d)} X_{(1,\ldots,s,\,d+1,\ldots,2d-s)} \right]

Notably, δ0=E[X(1,,d)X(d+1,,2d)]\delta_0 = \mathbb{E}[X_{(1,\ldots,d)} X_{(d+1,\ldots,2d)}].

  • Finite-population Hoeffding analogues Σs\Sigma_s: For 0sd0\leq s\leq d,

Σs=t=0s(1)st(st)δt\Sigma_s = \sum_{t=0}^s (-1)^{s-t} \binom{s}{t} \delta_t

  • Oscillation: The L1_1 deviation of coordinate block averages,

osc(X)=n1j=1n[n(d1)i(1)=jXi]2δ1L1\operatorname{osc}(X) = \left\| n^{-1}\sum_{j=1}^n \left[ n^{-(d-1)}\sum_{i(1)=j} X_i \right]^2 - \delta_1 \right\|_{L_1}

  • Global Mean-Deviation: B=ndiXiL22B = \| n^{-d} \sum_i X_i \|_{L_2}^2
  • Explicit Constant: κ(d)=20d318d(2d)!\kappa(d) = 20 d^3 18^d (2d)!

3. Structural Hypotheses and Nondegeneracy

The bound requires the following properties for XX and θ\theta:

  • (A1) Moment and variance control: E[Xi]=0\mathbb{E}[X_i]=0, E[Xi2]1\mathbb{E}[X_i^2]\leq 1, EXi3<\mathbb{E}|X_i|^3<\infty
  • (A2) Symmetry, exchangeability, diagonal-free:
    • Xi1idX_{i_1\ldots i_d} invariant under coordinate permutations
    • Distribution of XX invariant under any permutation of [n][n]
    • Xi=0X_{i}=0 if ii has repeated coordinates
  • (A3) Identical symmetry and diagonal-free assumptions for θ\theta
  • Nondegeneracy: δ1\delta_1 must be bounded below as specified above to avoid division by nearly zero denominators.

These structural conditions generalize beyond the i.i.d. setup to highly dependent, symmetric arrays where standard independence-based CLTs do not apply.

4. Connections to Classical Esseen, Bolthausen, and Barbour–Chen Bounds

The Dodos–Tyros bound generalizes several pivotal prior results:

  • i.i.d. (Esseen/Berry–Esseen):

dK(Xin,N(0,1))=O(EX3σ3n1/2)d_K\left( \frac{\sum X_i}{\sqrt{n}}, N(0,1) \right) = O \left( \frac{ \mathbb{E}|X|^3 }{ \sigma^3 n^{1/2} } \right)

for third-moment finite, independent entries.

  • Bolthausen’s combinatorial CLT: For order-1 permutation statistics, e.g. sums ξ(i,π(i))\sum \xi(i, \pi(i)), the optimal rate is

dK(statistic,N(0,1))=O(n1ξ(i,j)3)d_K(\text{statistic}, N(0,1)) = O \left( n^{-1}\sum |\xi(i,j)|^3 \right)

  • Barbour–Chen: For two-dimensional permutation U-statistics, the bound is

dK(W,N(0,1))=O(n1ξ133)+O(Var(ξ2)/n)d_K(W, N(0,1)) = O(n^{-1} \|\xi_1\|_3^3) + O(\operatorname{Var}(\xi_2)/n)

For d=1,2d=1,2, the Dodos–Tyros E2E_2 term matches the “linear” Berry–Esseen rates, while E3E_3 captures the degenerate variance–ratio perturbation, merging these regimes in a unified framework. For d>2d>2, the bound accommodates further tensor structure.

5. Combinatorial Central Limit Theorem for High-Dimensional Tensors

The key methodological advance is a combinatorial CLT tailored to random tensors and permutation statistics [Theorem 2.2]:

Let ξs:[n]s×[n]sR\xi_s: [n]^s \times [n]^s \to \mathbb{R} be Hoeffding-type symmetric, zero-average kernels with βs=i,p[n]sξs(i,p)2\beta_s = \sum_{i,p\in[n]^s} \xi_s(i,p)^2 and β1=n1\beta_1 = n-1. The statistic

W=s=1di[n]Injsξs(i,π(i))W = \sum_{s=1}^d \sum_{i \in [n]^s_{\mathrm{Inj}}} \xi_s(i, \pi(i))

where πUniform(Sn)\pi \sim \mathrm{Uniform}(\mathbb{S}_n), obeys

dK(W,N(0,1))218C1ni,jξ1(i,j)3+Cds=2dβsnsd_K(W,N(0,1)) \leq \frac{2^{18} C_1}{n} \sum_{i,j} |\xi_1(i,j)|^3 + C_d \sum_{s=2}^d \sqrt{ \frac{\beta_s}{n^s} }

with C1451C_1 \approx 451 (Bolthausen’s constant), Cd=5d2ed(2d)!C_d = 5d^2 e^d (2d)!.

The proof constructs an exchangeable-pair coupling (π1,π2)(\pi_1, \pi_2) via random transpositions and exploits Stein’s method—ensuring the required linearity and variance control—supported by a generalized Hoeffding multi-index variance decomposition and direct moment bounds. The approach extends Barbour–Chen’s Stein concentration-inequality methods to high-rank tensors.

6. Optimal Regimes and Theoretical Implications

Sharpness and optimality of the anti-concentration bound depend on intricate relationships between independence, degeneracy, and symmetry:

  • Oscillation-dominated regime: If osc(X)\operatorname{osc}(X) is large (weak dissociation), E1E_1 dominates and cannot be improved beyond O(osc(X))O(\operatorname{osc}(X)).
  • Nearly-linear regime: If XiX_i are almost independent and all δ0,δs0\delta_0, \delta_s \approx 0 (for s2s \geq 2), E2E_2 matches Bolthausen’s O(n1θi3)O(n^{-1} \sum |\theta_i|^3) term.
  • Partially degenerate regime: If degeneracy at some s2s \geq 2 is present but small, E3E_3 yields an error smaller than O(n1/2)O(n^{-1/2}), as in U-statistics of small effective rank.
  • High-dimensional/mixed regime: For large dd, regimes interpolate smoothly between independence and fully degenerate (Hoeffding) structures.

For i.i.d. entries, the bound recovers the classical O(n1/2)O(n^{-1/2}) Berry–Esseen rates when θ1\|\theta\|_1 is small, and Bolthausen’s O(n1)O(n^{-1}) rate when θi=0\sum \theta_i = 0. Fully degenerate U-statistics (δ1==δd1=0\delta_1 = \cdots = \delta_{d-1} = 0) achieve even faster rates in small effective-rank settings.

7. Significance and Extensions

The anti-concentration bound of Esseen for random tensors unifies and extends statistical normal approximation in highly symmetric, exchangeable, and high-dimensional settings. The explicit dependence on the oscillation, correlation, mean-deviation, and partial-sum seminorms provides practically computable error estimates that are minimax optimal in several key regimes. This anti-concentration framework is instrumental in analyzing linear and nonlinear permutation statistics, and provides a rigorous foundation for statistical inference in combinatorial and high-order data analytic scenarios (Dodos et al., 2022).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (1)

Whiteboard

Follow Topic

Get notified by email when new papers are published related to Anti-Concentration Bound of Esseen.