Papers
Topics
Authors
Recent
2000 character limit reached

Chatterjee's Rank Correlation Tests

Updated 9 October 2025
  • The paper introduces Chatterjee’s rank correlation, a nonparametric measure that is 0 for independent variables and 1 when one variable is an almost sure function of the other, offering an unbiased estimator and asymptotic normality under the null.
  • It employs efficient O(n log n) computation with rank-order differences and bias corrections, using the m-out-of-n bootstrap to ensure valid inference especially in non-i.i.d. or noncontinuous contexts.
  • The test framework is extended to multivariate and high-dimensional settings through nearest-neighbor graphs and spectral analysis, with boosted and combined methods to improve power in detecting complex dependencies.

Chatterjee's rank correlation-based tests constitute a family of nonparametric and distribution-free procedures for quantifying and detecting dependencies between random variables, with particular emphasis on directed (functional) rather than symmetric association. Unlike classical rank-based measures such as Spearman's rho or Kendall's tau, Chatterjee's correlation ξ\xi is tailored to measure the strength of functional dependence: ξ=0\xi=0 if and only if variables are independent, and %%%%2%%%% if and only if one is an (almost sure) measurable function of the other. Since its introduction, the theory, properties, extensions, comparative behavior, implementation protocols, and inferential implications of ξ\xi have been the subject of intensive research across probability, statistics, and applied computational mathematics.

1. Definition, Population Properties, and Computable Form

Chatterjee’s rank correlation is formally defined for a pair of continuous random variables (or via their copula CC) as

ξ(C)=60101[1C(u,v)]2uvdudv2\xi(C) = 6 \int_0^1 \int_0^1 \left[\partial_1 C(u,v)\right]^2 u v \, du \, dv - 2

where 1C(u,v)\partial_1 C(u,v) is the density of CC with respect to the first argument. It can equivalently be interpreted in terms of conditional variances, as

ξ(X,Y)=RVar(E[1YyX])dPY(y)RVar[1Yy]dPY(y)\xi(X, Y) = \frac{ \int_\mathbb{R} \operatorname{Var}\left(\mathbb{E}[1_{Y \geq y} \mid X]\right) dP_Y(y) }{ \int_\mathbb{R} \operatorname{Var}[1_{Y \geq y}] dP_Y(y) }

For independent (X,Y)(X, Y), ξ=0\xi=0; for perfect functional dependence Y=f(X)Y = f(X), ξ=1\xi=1. Intermediate values correspond to intermediate degrees of predictability, but ξ\xi does not linearize the notion of strength of association. In sample computations, an unbiased estimator is

ξn=13n21i=1n1r[i+1]r[i]\xi_n = 1 - \frac{3}{n^2-1} \sum_{i=1}^{n-1} |r_{[i+1]} - r_{[i]}|

where [i][i] indexes XX-ordered samples and r[i]r_{[i]} are the concomitant ranks of YY. This computation can be performed in O(nlogn)O(n \log n) time for continuous data with no ties.

2. Inference: Asymptotic Laws, Bootstrap, and Bias Correction

For i.i.d. data and continuous margins, the normalized statistic is asymptotically normal under the null,

nξndN(0,25)\sqrt{n} \, \xi_n \xrightarrow{d} N\left(0, \frac{2}{5}\right)

as established via Stein’s method and projection representations (Auddy et al., 2021, Lin et al., 2022, Kroll, 21 Aug 2024). General consistency and asymptotic normality extend to non-i.i.d. (strongly mixing) data given suitable assumptions, after a vanishing bias correction. Bootstrap inference for ξ\xi must employ the mm-out-of-nn bootstrap, not nn-out-of-nn, to ensure valid coverage and limit approximation, especially with noncontinuous data or under dependence (Dette et al., 2023, Dalitz et al., 2023). Simple normalization of ξn\xi_n by its finite-sample upper bound further reduces finite-sample bias.

3. Power, Detection Boundaries, and Comparison with Competing Methods

Chatterjee's test is consistent and distribution-free under arbitrary alternatives, but it is rate sub-optimal for local (contiguous) alternatives relative to independence: the detection threshold against local parametric alternatives scales as n1/4n^{-1/4} (e.g., for local correlations ρn\rho_n with ξρn2\xi \sim \rho_n^2) (Shi et al., 2020, Auddy et al., 2021). Competing methods such as Hoeffding’s DD, Blum–Kiefer–Rosenblatt’s RR, or Bergsma–Dassios–Yanagimoto’s τ\tau^* achieve the n1/2n^{-1/2} detection boundary and are thus preferred for local weak alternatives. Nevertheless, for testing non-trivial levels of association (e.g., distinguishing ξ=ξ0\xi=\xi_0 vs ξξ0cn|\xi-\xi_0| \geq c_n), the Chatterjee-based test is minimax optimal with cnc_n at the n1/2n^{-1/2} rate (Auddy et al., 2021).

Recent theoretical and algorithmic advances have addressed this shortcoming:

  • Boosted versions that aggregate information over MM right-nearest neighbors, ξn,M\xi_{n,M}, can achieve near-parametric detection boundaries, especially when MM is scaled appropriately with nn (Lin et al., 2021).
  • Combined tests (e.g., max{Sn,5/2ξn}\max\{|S_n|, \sqrt{5/2} \xi_n\} with SnS_n Spearman's rank correlation) yield uniform type I control and substantially improved power across monotonic and nonmonotonic alternatives (Zhang, 2023, Zhang, 24 Jun 2024).

4. Multivariate and High-Dimensional Extensions

Extensions of Chatterjee's methodology to multivariate (vector-valued) responses and predictors have been developed at two levels:

  • Azadkia–Chatterjee correlation: Employs nearest-neighbor graph constructions for multivariate variables. Recent versions use rank-based nearest-neighbor graphs to guarantee scale invariance, consistency, and asymptotic normality (Tran et al., 3 Dec 2024).
  • Multi-response dependence measure TT: For multivariate responses Y=(Y1,,Yq)Y = (Y_1,\ldots,Y_q) and predictors XX, the measure

T(YX)=1qi=1qξ(Yi(X,Y1,,Yi1))qi=1qξ(Yi(Y1,,Yi1))T(Y|X) = 1 - \frac{q - \sum_{i=1}^q \xi(Y_i | (X, Y_1,\ldots,Y_{i-1}))}{q - \sum_{i=1}^q \xi(Y_i | (Y_1,\ldots,Y_{i-1}))}

generalizes ξ\xi and is permutation-invariant under certain symmetrizations. The corresponding estimator TnT_n is strongly consistent and asymptotically normal (Ansari et al., 2022).

  • Tests of joint and complete independence: In high dimensions, quadratic sum statistics and extreme-value type statistics based on pairwise ξkl\xi_{kl} are developed for composite testing, enhanced with variable screening to address dense and sparse alternative regimes (Xia et al., 16 Sep 2024, Olivares et al., 27 Mar 2025). In all cases, the null distribution is derivable or accurately estimated by block-multiplier or mm-out-of-nn bootstraps.

A further generalization employs the distance-based Chatterjee correlation, where data are mapped to real-valued “distance transformed” representations (Szekely et al. transformation), allowing ξ\xi-type dependence measurement and causal inference for general multivariate or even complex-valued data (Pascual-Marqui et al., 24 Jun 2024).

5. Mathematical Relations to Classical Rank Correlations and Structural Constraints

Chatterjee's ξ\xi has fundamentally different structural and functional properties compared to Spearman’s ρ\rho and Spearman’s footrule ψ\psi:

  • For continuous bivariate copulas CC, ξ(C)\xi(C) is a quadratic functional of the copula's derivative, in contrast to ρ(C)\rho(C) or ψ(C)\psi(C) which are linear functionals.
  • The attainable (ξ,ρ)(\xi, \rho) region over all (or stochastically increasing/decreasing) copulas is convex, with boundaries defined by a family of piecewise-linear, absolutely continuous, asymmetric copulas (Ansari et al., 18 Jun 2025). For stochastically monotonic copulas,

ξρ\xi \leq |\rho|

always holds. The maximal possible gap ρξ\rho-\xi is $0.4$ for a specific copula with ξ=0.3\xi=0.3, ρ=0.7\rho=0.7.

  • The region (ξ,ψ)(\xi, \psi) for stochastically increasing copulas is exactly {(x,y):xyx}\{(x,y): x \leq y \leq \sqrt{x}\} (Rockel, 8 Sep 2025). The upper bound is uniquely achieved by the Fréchet copula, and the lower bound is approached by a newly constructed two-parameter copula family.
  • The Markov product of the copula and its transpose provides the essential link: ξ(C)=ψ(CC)\xi(C) = \psi(C^{\top} * C), connecting ξ\xi to ψ\psi via copula operations.

6. Spectral Analysis and Random Matrix Behavior

For large random vectors of independent variables, the empirical spectral distribution (ESD) of the symmetrized Chatterjee rank correlation matrix converges to the Wigner semicircle law, rather than the Marchenko–Pastur law found for Pearson, Spearman, or Kendall matrices (Dong et al., 8 Oct 2025). This deviation marks the first example of such a phenomenon among correlation matrices and is foundational for developing CLTs for linear spectral statistics and for tests of high-dimensional complete independence based on eigenvalue distributions.

7. Controversies and Statistical Limitations

A key theoretical limitation of Chatterjee’s ξ\xi is its lack of weak continuity (Bücher et al., 15 Oct 2024). That is, for any (X,Y)(X,Y) with continuous margins, one can approximate (X,Y)(X,Y) arbitrarily closely (in distribution) by random pairs for which ξ=1\xi=1, and thus for any ξ0[ξ(X,Y),1]\xi_0 \in [\xi(X,Y), 1], there exists a convergent sequence with constant ξ0\xi_0. Therefore,

  • No test based solely on ξ\xi can have nontrivial power uniformly separating ξ=0\xi=0 from ξ=1\xi=1; in particular, tests based on smooth functions of the empirical statistic have trivial power against such "perfect dependence yet nearby" alternatives.
  • Uniform asymptotic confidence intervals for ξ\xi must necessarily degenerate, covering the entire interval [ξ(P),1][\xi(P),1] with high probability.
  • This impossibility is not a defect particular to ξ\xi but is fundamental to any rank-based association measure that attains $1$ only for measurable functions.

These facts emphasize the importance of careful interpretation, the necessity for alternative or combined inferential strategies in certain regimes, and motivate the ongoing research in boosting, combining, and generalizing Chatterjee’s framework for practical and theoretical robustness across varied dependence structures.


Table: Comparative Aspects of Chatterjee’s Rank Correlation and Related Quantities

Quantity Null Limiting Law Power (Local Alt.) Maximal Value Functional Target
ξ\xi (Chatterjee) Normal, variance $2/5$ rate sub-optimal (n1/4n^{-1/4}) $1$ functional dependence (Y=f(X)Y=f(X))
ρ\rho (Spearman) Non-normal (degenerate) rate-optimal (n1/2n^{-1/2}) $1$ (monotone) concordance (monotonic)
τ\tau^*, DD, RR Non-normal (degenerate) rate-optimal (n1/2n^{-1/2}) less than $1$ concordance/symmetric dependence

This synthesis incorporates pivotal results on estimation, asymptotic and finite-sample performance, extension to multivariate and high-dimensional analysis, algebraic structure, and known theoretical limitations, with references to the primary literature where each point is established (Shi et al., 2020, Auddy et al., 2021, Ansari et al., 2022, Zhang, 2023, Dalitz et al., 2023, Zhang, 24 Jun 2024, Kroll, 21 Aug 2024, Xia et al., 16 Sep 2024, Bücher et al., 15 Oct 2024, Tran et al., 3 Dec 2024, Olivares et al., 27 Mar 2025, Ansari et al., 18 Jun 2025, Rockel, 8 Sep 2025, Dong et al., 8 Oct 2025).

Definition Search Book Streamline Icon: https://streamlinehq.com
References (18)
Slide Deck Streamline Icon: https://streamlinehq.com

Whiteboard

Forward Email Streamline Icon: https://streamlinehq.com

Follow Topic

Get notified by email when new papers are published related to Chatterjee's Rank Correlation-Based Tests.