Higher Criticism Statistic
- Higher Criticism Statistic is a method that aggregates and evaluates small p-values to detect sparse, weak signals in high-dimensional data.
- It leverages both empirical distributions and asymptotic theory to establish optimal detection boundaries in multiple-testing frameworks.
- Practical applications include multi-stream change-point detection with careful threshold calibration to minimize detection delays.
The higher criticism (HC) statistic is a central tool for large-scale detection of rare and weak signals, particularly within high-dimensional multiple-testing, change-point detection, and signal recovery problems. Introduced by Donoho and Jin, HC quantifies the aggregate excess of small -values relative to the null expectation, enabling detection of alternatives where only a small, unknown fraction of features or streams are affected. The statistic has a rich mathematical structure, precise asymptotic theory, and connections to optimal detection boundaries in sparse regimes.
1. Definition and Formulation
Let be one-sided -values, sorted so that . The canonical higher criticism statistic is
In large-scale testing applications, it is customary to restrict to for some fixed ; this restricts attention to the smallest -values, which are most informative under sparse alternatives.
HC can also be written functionally in terms of the empirical distribution function : and .
In the context of sequential or multi-stream problems, the statistic is applied at each time to the set of per-stream -values, leading to a sequence .
2. Operational Principle and Detection Boundary
HC is designed to optimally detect the presence of a sparse mixture, where an unknown, vanishing fraction of the population exhibits a weak deviation. Under the rare/weak normal means model— with —there is a sharp "detection boundary" in the parameter space: For , HC is fully powered—i.e., asymptotic type I plus type II error tends to 0. Below this curve, no test, including HC, is powerful.
In heteroscedastic or multistream settings, the boundary generalizes. Let be the affected stream fraction, , and for post-change variance : This boundary governs the minimum detection delay in multi-stream fastest change-point detection (Gong et al., 2024).
3. Calculation of Stream-wise -values and Multi-stream Aggregation
In multi-stream change-point detection, consider observations , , , where under the null , and under the alternative a sparse unknown subset of streams undergoes a post- shift to . The per-stream -values depend on the underlying detection statistic:
- CUSUM / Likelihood-Ratio (LR) Statistic ( known, ):
The -value is .
- Generalized Likelihood-Ratio (GLR) Statistic ( unknown, ):
The -value is .
For each time , the set is constructed, ordered, and HC is applied to aggregate evidence across all streams.
4. Stopping Rule, False Alarm Control, and Detection Delay
The global detection procedure is
where is the HC statistic at time over streams and is a threshold to guarantee a desired false-alarm rate (often taken constant).
Threshold Calibration:
- Under the null, one chooses so that as .
- This can be achieved via Monte Carlo or from the large-sample null theory of HC, which gives asymptotic Gumbel-type distributions.
Detection Delay:
- When a change occurs at unknown time , with affected streams and mean shift , the delay converges in distribution:
- Under , no alarm occurs with probability tending to 1.
Key Theorem ((Gong et al., 2024), Gong–Kipnis–Xie):
- There exists such that (i) , and (ii) .
- Uniformly over , the worst-case expected detection delay satisfies
5. Proof Techniques and Moderate Deviations Analysis
The proof combines:
- Uniformity under : the per-stream -values are i.i.d. Uniform(0,1), so the maximal HC is bounded by a threshold with high probability.
- Under : the affected streams yield -values exhibiting moderate-deviation (or log-) behavior:
where is standard normal. This drives a localized excess of small -values detectable by HC.
- Classical HC power analysis (Donoho–Jin framework) demonstrates that the detection occurs as soon as , pinning down the minimal delay.
This approach generalizes to the heteroscedastic case (), accommodating unknown post-change variances.
6. Implementation, Calibration, and Tuning Considerations
Algorithmic Steps:
- For each time , and each stream , compute a change-point detection statistic ( or ).
- Calculate per-stream -values using the exact null distribution.
- Collect and sort these -values; compute using a rank cutoff (e.g., top smallest -values).
- Signal a detected change if .
Threshold determination:
- Can be set empirically via Monte Carlo under the null model, or via asymptotic approximations: for large , is approximately Gumbel, scaling as .
- Limiting null distributions may be slow to set in finite ; empirical calibration is often preferred for stringent control.
Practical recommendations:
- For large , restrict maximization to (e.g., or $0.5$) to avoid instability from extreme-order -values.
- Under strong dependence or heteroscedastic variance, ensure uniformity of null -values holds.
- Computational cost is per time step (due to sorting and scan).
7. Significance, Limitations, and Comparison to Information-Theoretic Bounds
Significance:
- HC attains the optimal (information-theoretic) detection delay for sparse change-point detection, without requiring knowledge of which streams are affected or the precise value of .
- The approach extends to general settings (unknown variance, weak or moderate signals, heteroscedasticity) as long as streamwise -values are exactly or approximately uniform under the null.
Limitations:
- When the fraction of affected streams is not sparse (i.e., near 0), HC is suboptimal compared to bulk averaging procedures.
- Under heavy-tailed or serially dependent data, uniformity of -values may fail, requiring cautious model checking or adaptation.
- In the very low-count regime, phase transitions in detectability become non-Gaussian, and HC may require thresholding or cell selection for optimality (see (Chan, 2023)).
Comparison:
- In the special case , the HC-based procedure matches the delay lower bound derived in prior work (Chan, 2017), achieving minimax optimality among sequential detectors.
- The derived phase diagram in precisely coincides with the Donoho–Jin boundary, generalizing the result from mean-shift detection in high-dimensional mean testing to quickest change detection in multi-stream scenarios.
Summary Table: Detection Delay and Boundary
| Model Parameterization | Detection Delay | Detection Boundary |
|---|---|---|
| , , | <br> | |
| , , | See definition in Section 2 |
HC for multi-stream change-point detection achieves the theoretical detection delay lower bound under general settings, provided careful calibration and accurate -value computation are maintained. This framework is robust, adaptive, and achieves rate-optimal performance without requiring explicit signal localization.