Adaptive FDP Estimators
- Adaptive FDP estimators are statistical procedures that use data-driven methods to accurately estimate the false discovery proportion while ensuring finite-sample guarantees.
- They employ adaptive techniques, including linear step-up procedures, competition-based bounds, and high-dimensional regression adjustments, to improve power over classical methods.
- Empirical evaluations and theoretical guarantees demonstrate that these estimators offer tight simultaneous confidence envelopes and scalability, even under differential privacy constraints.
An adaptive FDP estimator is a statistical procedure or formula that estimates or bounds the false discovery proportion (FDP), defined as the ratio of the number of false discoveries (true nulls rejected) to the total number of rejections, in a way that leverages data-driven or parameter-adaptive choices, often with finite-sample and/or distributional guarantees. The recent literature examines adaptive FDP estimation under classical multiple testing (via p-values), competition-based FDR control (e.g., knockoffs), high-dimensional regression, and differential privacy, with several lines of inquiry on estimator consistency, simultaneous confidence envelopes, post-hoc flexibility, and efficiency.
1. Fundamental Definitions and Context
Let denote null hypotheses tested simultaneously. Given a selection rule that rejects hypotheses (among which are falsely rejected true nulls), the false discovery proportion is
and the false discovery rate is . Adaptive FDP estimation is distinguished by procedures that depend on data-driven estimators of nuisance parameters, typically the proportion of true nulls, or involve post hoc bounds informed by the observed data structure (Ditzhaus et al., 2018, Ebadi et al., 2023, Hemerik et al., 2022, Jeng et al., 2018).
2. Adaptive FDP Estimators in Classical Multiple Testing
Adaptive linear step-up procedures, generalizing the Benjamini-Hochberg (BH) procedure, use an estimator for the (unknown) number of true nulls. This enables more powerful testing by adapting critical values: with a tuning constant. Estimating (e.g., via convex combinations of generalized Storey-type estimators) and restricting to p-values below yields exact formulas for all moments of the FDP:
and similarly for higher moments (Ditzhaus et al., 2018). Estimator stability and abundance of rejections are necessary and sufficient for consistency ( in probability).
A large class of adaptive estimators involves bin-wise partitions and weights ,
These are shown to control FDR at finite , with strictly improved asymptotic FDR and power over BH when (Ditzhaus et al., 2018).
3. Data-Adaptive FDP Bounds and Simultaneous Envelopes
Recent developments provide median-unbiased and simultaneous envelopes for the FDP, valid over entire rejection paths and allowing post hoc choice of target FDP (denoted ). Under symmetry and mild stochastic ordering of true null p-values, one has
yielding the adaptive bound
with . To validly select post hoc, the envelope is strengthened: ensuring (Hemerik et al., 2022). These approaches yield computationally efficient algorithms (linear time after sorting) and mFDP-controlling procedures, with enhanced flexibility and interpretable adjusted p-values.
4. Competition-Based Adaptive FDP Bounds
In competition-based FDR control (e.g., knockoffs, target-decoy competition), adaptive FDP bounds are constructed via negative-binomial processes. For each hypothesis , one defines target and decoy wins, with the label or . For any cutoff , denote the decoy count and the target wins, with FDP .
Two principal adaptive upper bands for FDP are developed:
- Standardized band (TDC-SB):
with and the quantile of the standardized process.
- Uniform band (TDC-UB):
where is the largest threshold guaranteeing .
Both bands are shown to adapt tightly to the data (especially for small decoy counts) and empirically outperform the Katsevich–Ramdas bound in diverse settings, maintaining finite-sample exactness and scalability (Ebadi et al., 2023).
5. High-Dimensional Regression: Adaptive FDP Estimation
In high-dimensional regression, the de-sparsified Lasso (“DLasso”) estimator provides adaptive FDP estimation for variable selection. The statistic
is used to rank predictors. The plug-in FDP estimator
approximates the expected number of false discoveries via Normal tail probabilities. The threshold yields consistent FDP control under standard design and sparsity assumptions (Jeng et al., 2018).
6. Adaptive Estimation under Differential Privacy Constraints
Federated differential privacy (FDP) introduces new challenges for adaptation. In federated density estimation, servers add carefully tuned exponential noise in a multiscale oscillation norm to wavelet coefficient estimates, yielding (ε,0)-FDP privacy: Post-processing via block thresholding produces estimators attaining sharp adaptive rates: with analogous bounds for pointwise estimation. Lower bounds demonstrate that adaptation in global risk under FDP incurs an intrinsic factor in the privacy term, and the pointwise risk incurs two such factors, reflecting the unavoidable privacy-adaptation trade-off (Cai et al., 16 Dec 2025).
7. Comparative Properties and Empirical Performance
Adaptive FDP estimators expand on classical FDR control by:
- Providing exact finite-sample moment formulas for the FDP (Ditzhaus et al., 2018).
- Enabling post hoc selection of target FDP levels with simultaneous envelope guarantees (Hemerik et al., 2022).
- Offering tighter bounds, especially at low decoy counts, when benchmarking against existing simultaneous FDP-control bands (Ebadi et al., 2023).
- Achieving consistency and minimax rates, even in challenging regimes (high-dimensional, privacy constrained), where classical approaches may suffer conservatism or inefficiency (Jeng et al., 2018, Cai et al., 16 Dec 2025).
Empirical evaluations confirm that adaptive procedures maintain nominal control and improve power or tightness against competitors. A plausible implication is that envelope-based or negative-binomial-process bounds can yield sharper guarantees and computational tractability, even in the presence of dependence or unknown null proportions.
In summary, adaptive FDP estimators encompass a broad family of data-driven procedures for simultaneous multiple testing, variable selection, competition frameworks, and privacy-preserving inference. They achieve finite-sample validity, enhanced flexibility, and improved performance relative to traditional mean-FDP control, with rigorous consistency and minimax adaptivity in diverse statistical models.