Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
133 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Multiple testing with anytime-valid Monte-Carlo p-values (2404.15586v3)

Published 24 Apr 2024 in stat.ME

Abstract: In contemporary problems involving genetic or neuroimaging data, thousands of hypotheses need to be tested. Due to their high power, and finite sample guarantees on type-I error under weak assumptions, Monte-Carlo permutation tests are often considered as gold standard for these settings. However, the enormous computational effort required for (thousands of) permutation tests is a major burden. Recently, Fischer and Ramdas (2024) constructed a permutation test for a single hypothesis in which the permutations are drawn sequentially one-by-one and the testing process can be stopped at any point without inflating the type-I error. They showed that the number of permutations can be substantially reduced (under null and alternative) while the power remains similar. We show how their approach can be modified to make it suitable for a broad class of multiple testing procedures and particularly discuss its use with the Benjamini-Hochberg procedure. The resulting method provides valid error rate control and outperforms all existing approaches significantly in terms of power and/or required computational time. We provide fast implementations and illustrate its application on large datasets, both synthetic and real.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (41)
  1. A massive 7T fMRI dataset to bridge cognitive neuroscience and artificial intelligence. Nature Neuroscience, 25(1):116–126, 2022.
  2. Estimation of false discovery rate using sequential permutation p-values. Biometrics, 69(1):1–7, 2013.
  3. Sceptre improves calibration and sensitivity in single-cell crispr screen analysis. Genome biology, 22:1–19, 2021.
  4. Causal inference in genetic trio studies. Proceedings of the National Academy of Sciences, 117(39):24117–24126, 2020.
  5. Controlling the false discovery rate: a practical and powerful approach to multiple testing. Journal of the Royal Statistical Society: Series B (Methodology), 57(1):289–300, 1995.
  6. The control of the false discovery rate in multiple testing under dependency. Annals of Statistics, pages 1165–1188, 2001.
  7. Sequential Monte Carlo p-values. Biometrika, 78(2):301–304, 1991.
  8. A graphical approach to sequentially rejective multiple test procedures. Statistics in medicine, 28(4):586–604, 2009.
  9. Panning for gold: ‘model-X’ knockoffs for high dimensional controlled variable selection. Journal of the Royal Statistical Society Series B: Statistical Methodology, 80(3):551–577, 2018.
  10. Designing Monte Carlo implementations of permutation or bootstrap hypothesis tests. The American Statistician, 56(1):63–70, 2002.
  11. On the false discovery rate and an asymptotically optimal rejection curve. 2009.
  12. Sequential permutation testing by betting. arXiv preprint arXiv:2401.07365, 2024.
  13. MMCTest—a safe algorithm for implementing multiple Monte Carlo tests. Scandinavian Journal of Statistics, 41(4):1083–1101, 2014.
  14. A framework for Monte Carlo based multiple testing. Scandinavian Journal of Statistics, 43(4):1046–1063, 2016.
  15. QuickMMCTest: quick multiple Monte Carlo testing. Statistics and Computing, 27:823–832, 2017.
  16. Exceedance control of the false discovery proportion. Journal of the American Statistical Association, 101(476):1408–1417, 2006.
  17. Multiple testing for exploratory research. Statistical Science, 26(4):584–597, 2011.
  18. Phillip Good. Permutation tests: a practical guide to resampling methods for testing hypotheses. Springer Science & Business Media, 2013.
  19. Safe testing. Journal of the Royal Statistical Society Series B: Statistical Methodology (with discussion), 2024.
  20. Adaptive choice of the number of bootstrap samples in large scale multiple testing. Statistical Applications in Genetics and Molecular Biology, 7(1), 2008.
  21. Efficient permutation testing of variable importance measures by the example of random forests. Computational Statistics & Data Analysis, 181:107689, 2023.
  22. False discovery proportion estimation by permutations: confidence for significance analysis of microarrays. Journal of the Royal Statistical Society Series B: Statistical Methodology, 80(1):137–155, 2018.
  23. A texture statistics encoding model reveals hierarchical feature selectivity across human visual cortex. Journal of Neuroscience, 43(22):4144–4161, 2023.
  24. Yosef Hochberg. A sharper Bonferroni procedure for multiple tests of significance. Biometrika, 75(4):800–802, 1988.
  25. Sture Holm. A simple sequentially rejective multiple test procedure. Scandinavian Journal of Statistics, pages 65–70, 1979.
  26. Gerhard Hommel. A stagewise rejective multiple test procedure based on a modified Bonferroni test. Biometrika, 75(2):383–386, 1988.
  27. Time-uniform, nonparametric, nonasymptotic confidence sequences. Annals of Statistics, 2021.
  28. Statistical properties of an early stopping rule for resampling-based multiple testing. Biometrika, 99(4):973–980, 2012.
  29. On closed testing procedures with special reference to ordered analysis of variance. Biometrika, 63(3):655–660, 1976.
  30. Multiple comparisons in drug clinical trials and preclinical assays: a-priori ordered hypotheses. In Vollmar Joachim, editor, Biometrie in der Chemisch-Pharmazeutischen Industrie, pages 3–18. Fischer Verlag, Stuttgart, 1995.
  31. A parametric texture model based on joint statistics of complex wavelet coefficients. International Journal of Computer Vision, 40:49–70, 2000.
  32. Integrated analysis of pharmacologic, clinical and snp microarray data using projection onto the most interesting statistical evidence with adaptive permutation testing. International Journal of Data Mining and Bioinformatics, 5(2):143–157, 2011.
  33. Sequential Monte Carlo multiple testing. Bioinformatics, 27(23):3235–3241, 2011.
  34. Glenn Shafer. Testing by betting: a strategy for statistical and scientific communication. Journal of the Royal Statistical Society Series A: Statistics in Society (with discussion), 184(2):407–431, 2021.
  35. Power of the sequential Monte Carlo test. Sequential Analysis, 28(2):163–174, 2009.
  36. Random sampling: Practice makes imperfect. arXiv preprint arXiv:1810.10985, 2018.
  37. On weighted Hochberg procedures. Biometrika, 95(2):279–294, 2008.
  38. Significance analysis of microarrays applied to the ionizing radiation response. Proceedings of the National Academy of Sciences, 98(9):5116–5121, 2001.
  39. Permutation-based true discovery guarantee by sum tests. Journal of the Royal Statistical Society Series B: Statistical Methodology, 85(3):664–683, 2023.
  40. Resampling-based multiple testing: examples and methods for p-value adjustment, volume 279. John Wiley & Sons, 1993.
  41. Adaptive Monte Carlo multiple testing via multi-armed bandits. In International Conference on Machine Learning, pages 7512–7522. PMLR, 2019.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com