Doublethink: simultaneous Bayesian-frequentist model-averaged hypothesis testing (2312.17566v3)
Abstract: Establishing the frequentist properties of Bayesian approaches widens their appeal and offers new understanding. In hypothesis testing, Bayesian model averaging addresses the problem that conclusions are sensitive to variable selection. But Bayesian false discovery rate (FDR) guarantees are sensitive to subjective prior assumptions. Here we show that Bayesian model-averaged hypothesis testing is a closed testing procedure that controls the frequentist familywise error rate (FWER) in the strong sense. To quantify the FWER, we use the theory of regular variation and likelihood asymptotics to derive a chi-squared tail approximation for the model-averaged posterior odds. Convergence is pointwise as the sample size grows and, in a simplified setting subject to a minimum effect size assumption, uniform. The 'Doublethink' method computes simultaneous posterior odds and asymptotic p-values for model-averaged hypothesis testing. We explore Doublethink through a Mendelian randomization study and simulations, comparing to approaches like LASSO, stepwise regression, the Benjamini-Hochberg procedure, the harmonic mean p-value and e-values. We consider the limitations of the approach, including finite-sample inflation, and mitigations, like testing groups of correlated variables. We discuss the benefits of Doublethink, including post-hoc variable selection, and its wider implications for the theory and practice of hypothesis testing.
- H. Akaike. A new look at the statistical model identification. IEEE Transactions on Automatic Control, 19(6):716–723, 1974.
- Identifying direct risk factors in UK Biobank with simultaneous Bayesian-frequentist model-averaged hypothesis testing using Doublethink. In preparation, 2024.
- M. S. Bartlett. A comment on D. V. lindley’s statistical paradox. Biometrika, 44(1–2):533–534, 1957.
- D. Cox and D. Hinkley. Theoretical Statistics. Chapman and Hall, London, 1974.
- The limiting distribution of the likelihood ratio statistic under a class of local alternatives. Sankhyā: The Indian Journal of Statistics, Series A, pages 209–224, 1970.
- Limit theory for bilinear processes with heavy-tailed noise. The Annals of Applied Probability, 6(4):1191–1210, 1996.
- A. C. Davison. Statistical models, volume 11. Cambridge university press, 2003.
- L. Held and M. Ott. On p-values and Bayes factors. Annual Review of Statistics and Its Application, 5:393–419, 2018.
- Matrix analysis. Cambridge university press, 2012.
- J. Hu and V. E. Johnson. Bayesian model selection using test statistics. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 71(1):143–158, 2009.
- H. Jeffreys. The Theory of Probability. Clarendon, Oxford, first edition, 1939.
- V. E. Johnson. Properties of bayes factors based on test statistics. Scandinavian Journal of statistics, 35(2):354–368, 2008.
- J. Karamata. Sur un mode de croissance régulière. théorèmes fondamentaux. Bull. Soc. Math. France, 61:55–62, 1933.
- R. E. Kass and L. Wasserman. A reference Bayesian test for nested hypotheses and its relationship to the schwarz criterion. Journal of the American Statistical Association, 90(431):928–934, 1995.
- D. V. Lindley. A statistical paradox. Biometrika, 44(1/2):187–192, 1957.
- On closed testing procedures with special reference to ordered analysis of variance. Biometrika, 63(3):655–660, 1976.
- C. D. Meyer and I. Stewart. Matrix analysis and applied linear algebra. SIAM, 2023.
- T. Mikosch. Regular variation, subexponentiality and their applications in probability theory. Eindhoven University of Technology, Eindhoven, The Netherlands, 1999.
- J. Neyman and E. S. Pearson. Ix. on the problem of the most efficient tests of statistical hypotheses. Philosophical Transactions of the Royal Society of London. Series A, Containing Papers of a Mathematical or Physical Character, 231(694–706):289–337, 1933.
- A. O’Hagan. Fractional Bayes factors for model comparison. Journal of the Royal Statistical Society: Series B (Methodological), 57(1):99–118, 1995.
- A. O’Hagan and J. J. Forster. Kendall’s Advanced Theory of Statistics Volume 2B: Bayesian Inference. Arnold, London, 2004.
- G. Orwell. Nineteen Eighty-Four. Secker & Warburg, London, 1949.
- H. Scheffé. The analysis of variance john wiley & sons. Inc., New York, pages 25–50, 1959.
- G. Schwarz et al. Estimating the dimension of a model. Annals of Statistics, 6(2):461–464, 1978.
- S. Stigler. Fisher and the 5% level. Chance, 21(4):12–12, 2008.
- Kendall’s Advanced Theory of Statistics Volume 2A: Classical Inference and the Linear Model. Hodder Education, London, 1998.
- A. Wald. Tests of statistical hypotheses concerning several parameters when the number of observations is large. Transactions of the American Mathematical Society, 54(3):426–482, 1943.
- S. S. Wilks. The large-sample distribution of the likelihood ratio for testing composite hypotheses. Annals of Mathematical Statistics, 9(1):60–62, 1938.
- D. J. Wilson. The harmonic mean p𝑝pitalic_p-value for combining dependent tests. Proceedings of the National Academy of Sciences, 116(4):1195–1200, 2019.
- A. Zellner. On assessing prior distributions and bayesian regression analysis with g-prior distributions. Bayesian inference and decision techniques, 1986.
- Y. Zhou and Y. Xiao. Tail asymptotics for the extremes of bivariate Gaussian random fields. Bernoulli, 23(3):1566–1598, 2017.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.