Boosting e-BH via conditional calibration (2404.17562v1)
Abstract: The e-BH procedure is an e-value-based multiple testing procedure that provably controls the false discovery rate (FDR) under any dependence structure between the e-values. Despite this appealing theoretical FDR control guarantee, the e-BH procedure often suffers from low power in practice. In this paper, we propose a general framework that boosts the power of e-BH without sacrificing its FDR control under arbitrary dependence. This is achieved by the technique of conditional calibration, where we take as input the e-values and calibrate them to be a set of "boosted e-values" that are guaranteed to be no less -- and are often more -- powerful than the original ones. Our general framework is explicitly instantiated in three classes of multiple testing problems: (1) testing under parametric models, (2) conditional independence testing under the model-X setting, and (3) model-free conformalized selection. Extensive numerical experiments show that our proposed method significantly improves the power of e-BH while continuing to control the FDR. We also demonstrate the effectiveness of our method through an application to an observational study dataset for identifying individuals whose counterfactuals satisfy certain properties.
- A survey of anomaly detection techniques in financial domain. Future Generation Computer Systems, 55:278–288.
- Controlling the false discovery rate via knockoffs. The Annals of Statistics, 43(5):2055 – 2085.
- Derandomized novelty detection with fdr control via conformal e-values. Advances in Neural Information Processing Systems, 36.
- Testing for outliers with conformal p-values. The Annals of Statistics, 51(1):149–178.
- Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal statistical society: series B (Methodological), 57(1):289–300.
- The control of the false discovery rate in multiple testing under dependency. Annals of statistics, pages 1165–1188.
- Panning for gold:‘model-x’knockoffs for high dimensional controlled variable selection. Journal of the Royal Statistical Society Series B: Statistical Methodology, 80(3):551–577.
- Assessing treatment effect variation in observational studies: Results from a data challenge. Observational Studies, 5(2):21–35.
- Novelty detection-based approach for alzheimer’s disease and mild cognitive impairment diagnosis from eeg. Medical & Biological Engineering & Computing, 59:2287–2296.
- Multiple testing under negative dependence. arXiv preprint arXiv:2212.09706.
- Farcomeni, A. (2007). Some results on the control of the false discovery rate under dependence. Scandinavian Journal of Statistics, 34(2):275–297.
- On the benjamini–hochberg method.
- Conditional calibration for false discovery rate control under dependence. The Annals of Statistics, 50(6):3091 – 3118.
- Gao, Z. (2023). Adaptive storey’s null proportion estimator. arXiv preprint arXiv:2310.06357.
- Simultaneous hypothesis testing using internal negative controls with an application to proteomics. arXiv preprint arXiv:2303.01552.
- A stochastic process approach to false discovery control.
- Safe testing. Journal of the Royal Statistical Society, Series B (Methodology), with discussion.
- Grünwald, P. D. (2023). The e-posterior. Philosophical Transactions of the Royal Society A, 381(2247):20220146.
- Time-uniform chernoff bounds via nonnegative supermartingales. Probability Surveys.
- Time-uniform, nonparametric, nonasymptotic confidence sequences. The Annals of Statistics, 49(2):1055 – 1080.
- A two-sample conditional distribution test using conformal prediction and weighted rank sum. Journal of the American Statistical Association, pages 1–19.
- E-values as unnormalized weights in multiple testing. Biometrika, page asad057.
- Causal inference in statistics, social, and biomedical sciences. Cambridge university press.
- Model-free selective inference under covariate shift via weighted conformal p-values. arXiv preprint arXiv:2307.09291.
- The numeraire e-variable. arXiv preprint arXiv:2402.18810.
- E-values, multiple testing and beyond. arXiv preprint arXiv:2312.02905.
- Integrative conformal p-values for powerful out-of-distribution testing with labeled outliers. arXiv preprint arXiv:2208.11111.
- Improving knockoffs with conditional calibration. arXiv preprint arXiv:2208.09542.
- Semi-supervised multiple testing. Electronic Journal of Statistics, 16(2):4926–4981.
- Scikit-learn: Machine learning in Python. Journal of Machine Learning Research, 12:2825–2830.
- Game-theoretic statistics and safe anytime-valid inference. Statistical Science, 38(4):576–601.
- Randomized and exchangeable improvements of markov’s, chebyshev’s and chernoff’s inequalities. arXiv preprint arXiv:2304.02611.
- Derandomised knockoffs: leveraging e-values for false discovery rate control. Journal of the Royal Statistical Society Series B: Statistical Methodology, 86(1):122–154.
- Rubin, D. B. (1974). Estimating causal effects of treatments in randomized and nonrandomized studies. Journal of educational Psychology, 66(5):688.
- Sarkar, S. K. (2023). On controlling the false discovery rate in multiple testing of the means of correlated normals against two-sided alternatives. arXiv preprint arXiv:2304.05261.
- Shafer, G. (2021). Testing by betting: A strategy for statistical and scientific communication. Journal of the Royal Statistical Society Series A: Statistics in Society, 184(2):407–431.
- Test Martingales, Bayes Factors and p-Values. Statistical Science, 26(1):84 – 101.
- In situ cell-type-specific cell-surface proteomic profiling in mice. Neuron, 110(23):3882–3896.
- Powerful knockoffs via minimizing reconstructability. The Annals of Statistics, 50(1):252–276.
- Storey, J. D. (2002). A direct approach to false discovery rates. Journal of the Royal Statistical Society Series B: Statistical Methodology, 64(3):479–498.
- Strong control, conservative point estimation and simultaneous conservative consistency of false discovery rates: a unified approach. Journal of the Royal Statistical Society Series B: Statistical Methodology, 66(1):187–205.
- Novelty detection for the identification of masses in mammograms.
- Conformal prediction under covariate shift. Advances in neural information processing systems, 32.
- Nonparametric e-tests of symmetry. New England Journal of Statistics in Data Science.
- Algorithmic learning in a random world, volume 29. Springer.
- E-values: Calibration, combination and applications. The Annals of Statistics, 49(3):1736–1754.
- Confidence and discoveries with e-values. Statistical Science, 38(2):329–354.
- E-backtesting. arXiv preprint arXiv:2209.00991.
- False discovery rate control with e-values. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(3):822–852.
- Universal inference. Proceedings of the National Academy of Sciences, 117(29):16880–16890.
- Time-uniform central limit theory and asymptotic confidence sequences. arXiv preprint arXiv:2103.06476.
- Estimating means of bounded random variables by betting. Journal of the Royal Statistical Society Series B: Statistical Methodology, 86(1):1–27.
- Anytime-valid off-policy inference for contextual bandits. ACM/JMS Journal of Data Science.
- A power and prediction analysis for knockoffs with lasso statistics. arXiv preprint arXiv:1712.06465.
- More powerful multiple testing under dependence via randomization. arXiv preprint arXiv:2305.11126.
- Online multiple testing with e-values. arXiv preprint arXiv:2311.06412.
- Post-selection inference for e-value based confidence intervals. arXiv preprint arXiv:2203.12572.
- On the existence of powerful p-values and e-values for composite hypotheses. ArXiv preprint. arXiv:2305.16539.