False Discovery Rate Control
- False Discovery Rate (FDR) is the expected ratio of false rejections to total rejections, providing a foundational metric in multiple testing correction.
- Competition-based methods such as target–decoy and knockoff frameworks use score comparisons to estimate and manage the false discovery proportion effectively.
- Advanced procedures like FDP-SD and prediction bands offer probabilistic guarantees on the realized FDP, ensuring high-confidence control in practical applications.
False Discovery Rate (FDR) control is the cornerstone of modern multiple testing, designed to bound the expected proportion of false positives among rejected hypotheses. While procedures such as the Benjamini–Hochberg method ensure that the expected false discovery proportion (FDP) does not exceed a nominal level α, the realized FDP in any specific dataset may substantially exceed α. In the context of competition-based FDR control—chiefly, target–decoy competition (TDC) in proteomics and "knockoff"-based variable selection in regression—a combinatorial structure enables robust FDR control, but further tools are needed for high-confidence control of the realized FDP. Methods such as FDP-SD ("FDP step-down") provide probabilistic guarantees on the FDP, closing the gap between average-case and probabilistic control.
1. Formal Definitions and Competition-Based Frameworks
Let hypotheses be tested, with the number of rejections and the number of false rejections (true nulls rejected). The False Discovery Proportion is
and the False Discovery Rate is its expectation, . FDR control at level ensures .
In Target–Decoy Competition (TDC) frameworks, each hypothesis (e.g., peptide-spectrum match or feature) is assigned a “target” score (signal) and a “decoy” score (null), typically constructed via data shuffling or synthetically generating knockoff variables. For each hypothesis:
- is the “winning” score.
- if (target wins), if (decoy wins).
The hypotheses are sorted in decreasing . For any ,
TDC estimates the FDP among the top hypotheses as and reports all target wins up to
Under the key assumption that, for true nulls, is i.i.d. Rademacher (exchangeability), the procedure guarantees FDR control at level (Luo et al., 2020).
In the knockoff framework for linear regression, a synthetic "knockoff" feature matrix is constructed with exchangeability: null features are indistinguishable from their knockoffs. Each feature/knockoff pair yields statistics , and the entire procedure mirrors the TDC logic (Luo et al., 2020).
2. Limitation of FDR and the Need for High-Confidence FDP Control
While competition-based FDR procedures robustly control the expected value , they provide no guarantee that the realized FDP in any given discovery list is near . Instances where FDP can occur with non-negligible probability—a concern for practitioners requiring strong guarantees on false findings within specific results (Luo et al., 2020, Ebadi et al., 2023).
A probabilistic strengthening of FDR control is False Discovery Proportion-exceedance control (FDX):
for a user-specified tolerance . While classic FDR procedures do not address FDX, post-hoc prediction bands or step-down methods can provide explicit bounds on the realized FDP with high probability.
3. The FDP-SD Step-Down Procedure
FDP-SD is an adaptation of generalized step-down control for the competition context. Its core innovation is a set of data-dependent bounds that guarantee, with prescribed confidence , that the FDP does not exceed . The method proceeds as follows (Luo et al., 2020):
Critical values construction: Given the sorted sequence , for each , FDP-SD computes
where .
Algorithm:
- For each , compute .
- For , define as the largest such that for all .
- Report all target wins among the top .
Guarantee: Under exchangeability, the realized FDP among reported discoveries is controlled:
The method generalizes to multiple decoys and can accommodate further tuning via parameters and for aggressive TDC/knockoff variants.
4. Alternative FDP Prediction Bands: TDC-SB and TDC-UB
Instead of a step-down rule, prediction bands can wrap around any competition-based FDR method to provide a running upper confidence bound on the realized FDP (Ebadi et al., 2023). The TDC-SB (Standardized Band) and TDC-UB (Uniform Band) procedures consider, for each number of decoy wins :
- : number of true-null target-wins prior to the th decoy win.
- The key stochastic upper bound on involves augmenting the observed process so that is stochastically dominated by a negative-binomial process , with a function of TDC/knockoff parameters.
Two constructions:
- TDC-SB: Uses normal approximations to produce
where is the quantile of the standardized process, .
- TDC-UB: Uses the quantile transformation of to define
where is chosen so .
These bounds are then mapped back onto the sorted hypotheses; for any , an upper bound on FDP is provided such that (Ebadi et al., 2023).
Empirically, both SB and UB bands are much tighter than the Katsevich–Ramdas band, and in practical scenarios (proteomics, GWAS, model-X knockoff regression) the upper FDP bounds are near sharp, especially using UB (Ebadi et al., 2023).
5. Theoretical Underpinnings and Assumptions
The key probabilistic structure underlying competition-based FDR control is:
- Exchangeability: For each true-null, the distribution of is invariant to permutation, so is equally likely or and independent across hypotheses.
- Independence: Labels for true nulls are independent, and their distribution does not depend on the scores of false-nulls or their own winning scores (Luo et al., 2020, Ebadi et al., 2023).
These assumptions support the exact calibration of binomial or negative-binomial upper bounding distributions, enabling both expectation control (FDR) and tail probability control (FDX/prediction bands).
6. Practical Implications, Computation, and Extensions
FDP-SD and prediction bands (TDC-SB, TDC-UB) are computationally tractable:
- Sorting requires , while computation of or can be made per or via precomputed tables or incremental updates.
- Flexibility to trade off the nominal level and the tail probability is explicit, in contrast to basic FDR procedures.
Applications include:
- Proteomics: direct substitution of TDC final thresholding with FDP-SD for strong FDP control.
- High-dimensional regression: model-X knockoff pipelines may use the same combinatorial logic, with prediction bands for post-hoc validation.
- GWAS and simulation studies consistently demonstrate that FDP-SD dominates competing methods (e.g., Katsevich–Ramdas band) in terms of power and tightness of FDP bounds, with only minor sacrifice relative to ordinary FDR control.
Generalizations to multiple decoys and randomization for exact calibration are feasible. Future extensions suggested include the optimization of aggressiveness parameters and exploring improved sharpenings for multi-decoy frameworks (Luo et al., 2020, Ebadi et al., 2023).
7. Historical and Methodological Context
Competition-based FDR control, formalized in proteomics (target–decoy) and generalized in statistics through model-X knockoffs, represents a distinctive approach leveraging explicit null labeling and exchangeability. Standard FDR control operates in expectation; methods such as FDP-SD and its prediction-band analogues elevate the guarantee to the probability that the realized FDP never exceeds a threshold, thus filling a fundamental gap between average-case and post-hoc, reproducible false discovery control. The combination of combinatorial structure, probabilistic null modeling, and analytical prediction bands sets competition-based control apart as a unifying theme for rigorous, interpretable multiple testing (Luo et al., 2020, Ebadi et al., 2023).