Papers
Topics
Authors
Recent
Search
2000 character limit reached

A conformal test of linear models via permutation-augmented regressions

Published 11 Sep 2023 in stat.ME, math.ST, and stat.TH | (2309.05482v3)

Abstract: Permutation tests are widely recognized as robust alternatives to tests based on normal theory. Random permutation tests have been frequently employed to assess the significance of variables in linear models. Despite their widespread use, existing random permutation tests lack finite-sample and assumption-free guarantees for controlling type I error in partial correlation tests. To address this ongoing challenge, we have developed a conformal test through permutation-augmented regressions, which we refer to as PALMRT. PALMRT not only achieves power competitive with conventional methods but also provides reliable control of type I errors at no more than $2\alpha$, given any targeted level $\alpha$, for arbitrary fixed designs and error distributions. We have confirmed this through extensive simulations. Compared to the cyclic permutation test (CPT) and residual permutation test (RPT), which also offer theoretical guarantees, PALMRT does not compromise as much on power or set stringent requirements on the sample size, making it suitable for diverse biomedical applications. We further illustrate the differences in a long-Covid study where PALMRT validated key findings previously identified using the t-test after multiple corrections, while both CPT and RPT suffered from a drastic loss of power and failed to identify any discoveries. We endorse PALMRT as a robust and practical hypothesis test in scientific research for its superior error control, power preservation, and simplicity. An R package for PALMRT is available at \url{https://github.com/LeyingGuan/PairedRegression}.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (35)
  1. Permutation tests for linear models. Australian & New Zealand Journal of Statistics 43(1), 75–88.
  2. Barber, R. F. and E. J. Candès (2015). Controlling the false discovery rate via knockoffs. The Annals of Statistics 43(5).
  3. Predictive inference with the jackknife+.
  4. Davison, A. C. and D. V. Hinkley (1997). Bootstrap methods and their application. Number 1. Cambridge university press.
  5. Bootstrap confidence intervals. Statistical science 11(3), 189–228.
  6. Diciccio, T. J. and J. P. Romano (1988). A review of bootstrap confidence intervals. Journal of the Royal Statistical Society Series B: Statistical Methodology 50(3), 338–354.
  7. Draper, N. R. and D. M. Stoneman (1966). Testing for the inclusion of variables in einear regression by a randomisation technique. Technometrics 8(4), 695–699.
  8. Randomization tests. CRC press.
  9. Efron, B. (1987). Better bootstrap confidence intervals. Journal of the American statistical Association 82(397), 171–185.
  10. The automatic construction of bootstrap confidence intervals. Journal of Computational and Graphical Statistics 29(3), 608–619.
  11. Efron, B. and R. J. Tibshirani (1994). An introduction to the bootstrap. CRC press.
  12. Fisher, R. A. (1922). The goodness of fit of regression formulae, and the distribution of regression coefficients. Journal of the Royal Statistical Society 85(4), 597–612.
  13. Fisher, R. A. (1970). Statistical methods for research workers. In Breakthroughs in statistics: Methodology and distribution, pp.  66–70. Springer.
  14. Fisher, R. A. et al. (1924). 036: On a distribution yielding the error functions of several well known statistics.
  15. A nonstochastic interpretation of reported significance levels. Journal of Business & Economic Statistics 1(4), 292–298.
  16. Garthwaite, P. H. (1996). Confidence intervals from randomization tests. Biometrics, 1387–1393.
  17. Nested conformal prediction and quantile out-of-bag ensemble methods. Pattern Recognition 127, 108496.
  18. Hall, P. (1988). Theoretical comparison of bootstrap confidence intervals. The Annals of Statistics, 927–953.
  19. Hall, P. and S. R. Wilson (1991). Two guidelines for bootstrap hypothesis testing. Biometrics, 757–762.
  20. Conformalized semi-supervised random forest for classification and abnormality detection. arXiv preprint arXiv:2302.02237.
  21. Kennedy, F. E. (1995). Randomization tests in econometrics. Journal of Business & Economic Statistics 13(1), 85–94.
  22. Predictive inference is free with the jackknife+-after-bootstrap. Advances in Neural Information Processing Systems 33, 4138–4149.
  23. Distinguishing features of long covid identified through immune profiling. Nature, 1–3.
  24. Lei, L. and P. J. Bickel (2021). An assumption-free exact test for fixed-design linear models with exchangeable errors. Biometrika 108(2), 397–412.
  25. Manly, B. F. (2006). Randomization, bootstrap and Monte Carlo methods in biology, Volume 70. CRC press.
  26. Meinshausen, N. (2015). Group bound: confidence intervals for groups of variables in sparse high dimensional regression without assumptions on the design. Journal of the Royal Statistical Society Series B: Statistical Methodology 77(5), 923–945.
  27. Pitman, E. J. G. (1937). Significance tests which may be applied to samples from any populations. ii. the correlation coefficient test. Supplement to the Journal of the Royal Statistical Society 4(2), 225–232.
  28. Ren, Z. and R. F. Barber (2022). Derandomized knockoffs: leveraging e-values for false discovery rate control. arXiv preprint arXiv:2205.15461.
  29. Ter Braak, C. J. (1992). Permutation versus bootstrap significance tests in multiple regression and anova. In Bootstrapping and Related Techniques: Proceedings of an International Conference, Held in Trier, FRG, June 4–8, 1990, pp.  79–85. Springer.
  30. Cross-conformal predictive distributions. In conformal and probabilistic prediction and applications, pp.  37–51. PMLR.
  31. E-values: Calibration, combination and applications. The Annals of Statistics 49(3), 1736–1754.
  32. False discovery rate control with e-values. Journal of the Royal Statistical Society Series B: Statistical Methodology 84(3), 822–852.
  33. Residual permutation test for high-dimensional regression coefficient testing. arXiv preprint arXiv:2211.16182.
  34. Westfall, P. H. and S. S. Young (1993). Resampling-based multiple testing: Examples and methods for p-value adjustment, Volume 279. John Wiley & Sons.
  35. Permutation inference for the general linear model. Neuroimage 92, 381–397.
Citations (4)

Summary

Paper to Video (Beta)

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Continue Learning

We haven't generated follow-up questions for this paper yet.

Authors (1)

Collections

Sign up for free to add this paper to one or more collections.