Re-evaluating the impact of hormone replacement therapy on heart disease using match-adaptive randomization inference (2403.01330v1)
Abstract: Matching is an appealing way to design observational studies because it mimics the data structure produced by stratified randomized trials, pairing treated individuals with similar controls. After matching, inference is often conducted using methods tailored for stratified randomized trials in which treatments are permuted within matched pairs. However, in observational studies, matched pairs are not predetermined before treatment; instead, they are constructed based on observed treatment status. This introduces a challenge as the permutation distributions used in standard inference methods do not account for the possibility that permuting treatments might lead to a different selection of matched pairs ($Z$-dependence). To address this issue, we propose a novel and computationally efficient algorithm that characterizes and enables sampling from the correct conditional distribution of treatment after an optimal propensity score matching, accounting for $Z$-dependence. We show how this new procedure, called match-adaptive randomization inference, corrects for an anticonservative result in a well-known observational study investigating the impact of hormone replacement theory (HRT) on coronary heart disease and corroborates experimental findings about heterogeneous effects of HRT across different ages of initiation in women. Keywords: matching, causal inference, propensity score, permutation test, Type I error, graphs.
- Abadie, A. and Imbens, G. W. (2006), “Large sample properties of matching estimators for average treatment effects,” Econometrica, 74, 235–267.
- — (2011), “Bias-corrected matching estimators for average treatment effects,” Journal of Business & Economic Statistics, 29, 1–11.
- Anderson, G. L., Limacher, M., Assaf, A. R., Bassford, T., Beresford, S. A., Black, H., Bonds, D., Brunner, R., Brzyski, R., Caan, B., et al. (2004), “Effects of conjugated equine estrogen in postmenopausal women with hysterectomy: the Women’s Health Initiative randomized controlled trial.” Jama, 291, 1701–1712.
- Baiocchi, M. (2011), “Methodologies for observational studies of health care policy,” Ph.D. thesis, University of Pennsylvania.
- Basse, G., Feller, A., and Toulis, P. (2019), “Randomization tests of causal effects under interference,” Biometrika, 106, 487–494.
- Berrett, T. B., Wang, Y., Barber, R. F., and Samworth, R. J. (2020), “The conditional permutation test for independence while controlling for confounders,” Journal of the Royal Statistical Society: Series B (Statistical Methodology), 82, 175–197.
- Branson, Z. and Miratrix, L. W. (2019), “Randomization tests that condition on non-categorical covariate balance,” Journal of Causal Inference, 7.
- Brown Jr, B. W. (1980), “Statistical controversies in the design of clinical trials—some personal views,” Controlled Clinical Trials, 1, 13–27.
- Cagnacci, A. and Venier, M. (2019), “The controversial history of hormone replacement therapy,” Medicina, 55, 602.
- Chang, T.-H. and Stuart, E. A. (2022), “Propensity score methods for observational studies with clustered data: A review,” Statistics in Medicine, 41, 3612–3626.
- for the Women’s Health Initiative Investigators, W. G. (2002), “Risks and benefits of estrogen plus progestin in healthy postmenopausal women: principal results from the Women’s Health Initiative randomized controlled trial,” Jama, 288, 321–333.
- Grady, D., Rubin, S. M., Petitti, D. B., Fox, C. S., Black, D., Ettinger, B., Ernster, V. L., and Cummings, S. R. (1992), “Hormone therapy to prevent disease and prolong life in postmenopausal women,” Annals of internal medicine, 117, 1016–1037.
- Grodstein, F., Stampfer, M. J., Colditz, G. A., Willett, W. C., Manson, J. E., Joffe, M., Rosner, B., Fuchs, C., Hankinson, S. E., Hunter, D. J., et al. (1997), “Postmenopausal hormone therapy and mortality,” New England Journal of Medicine, 336, 1769–1776.
- Guo, K. and Rothenhäusler, D. (2022), “On the statistical role of inexact matching in observational studies,” Biometrika, asac066.
- Hansen, B. B. (2004), “Full matching in an observational study of coaching for the SAT,” Journal of the American Statistical Association, 99, 609–618.
- — (2009), “Propensity score matching to recover latent experiments: diagnostics and asymptotics,” Tech. Rep. 486, University of Michigan.
- — (2023), “Matching calipers and the precision of index estimation,” arXiv preprint arXiv:2301.04109.
- Hansen, B. B., Rosenbaum, P. R., and Small, D. S. (2014), “Clustered treatment assignments and sensitivity to unmeasured biases in observational studies,” Journal of the American Statistical Association, 109, 133–144.
- Herrington, D. M., Reboussin, D. M., Brosnihan, K. B., Sharp, P. C., Shumaker, S. A., Snyder, T. E., Furberg, C. D., Kowalchuk, G. J., Stuckey, T. D., Rogers, W. J., et al. (2000), “Effects of estrogen replacement on the progression of coronary-artery atherosclerosis,” New England Journal of Medicine, 343, 522–529.
- Hodis, H. N. and Mack, W. J. (2008), “Postmenopausal hormone therapy and cardiovascular disease in perspective,” Clinical obstetrics and gynecology, 51, 564.
- Hopcroft, J. and Tarjan, R. (1973), “Algorithm 447: efficient algorithms for graph manipulation,” Communications of the ACM, 16, 372–378.
- Hulley, S., Grady, D., Bush, T., Furberg, C., Herrington, D., Riggs, B., Vittinghoff, E., Heart, Group, E. R. S. H. R., Heart, Group, E. R. S. H. R., et al. (1998), “Randomized trial of estrogen plus progestin for secondary prevention of coronary heart disease in postmenopausal women,” Jama, 280, 605–613.
- Lobo, R. A. (2017), “Hormone-replacement therapy: current thinking,” Nature Reviews Endocrinology, 13, 220–231.
- Luo, X., Dasgupta, T., Xie, M., and Liu, R. Y. (2021), “Leveraging the Fisher randomization test using confidence distributions: Inference, combination and fusion learning,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 83, 777–797.
- Manson, J. E., Chlebowski, R. T., Stefanick, M. L., Aragaki, A. K., Rossouw, J. E., Prentice, R. L., Anderson, G., Howard, B. V., Thomson, C. A., LaCroix, A. Z., et al. (2013), “Menopausal hormone therapy and health outcomes during the intervention and extended poststopping phases of the Women’s Health Initiative randomized trials,” Jama, 310, 1353–1368.
- Ming, K. and Rosenbaum, P. R. (2000), “Substantial gains in bias reduction from matching with a variable number of controls,” Biometrics, 56, 118–124.
- Pashley, N. E., Basse, G. W., and Miratrix, L. W. (2021), “Conditional as-if analyses in randomized experiments,” Journal of Causal Inference, 9, 264–284.
- Pimentel, S. D. and Huang, Y. (2023), “Covariate-adaptive randomization inference in matched designs,” arXiv preprint arXiv:2207.05019.
- Pimentel, S. D., Kelz, R. R., Silber, J. H., and Rosenbaum, P. R. (2015), “Large, sparse optimal matching with refined covariate balance in an observational study of the health outcomes produced by new surgeons,” Journal of the American Statistical Association, 110, 515–527.
- Prentice, R. L., Langer, R., Stefanick, M. L., Howard, B. V., Pettinger, M., Anderson, G., Barad, D., Curb, J. D., Kotchen, J., Kuller, L., et al. (2005), “Combined postmenopausal hormone therapy and cardiovascular disease: toward resolving the discrepancy between observational studies and the Women’s Health Initiative clinical trial,” American journal of epidemiology, 162, 404–414.
- Puelz, D., Basse, G., Feller, A., and Toulis, P. (2022), “A graph-theoretic approach to randomization tests of causal effects under general interference,” Journal of the Royal Statistical Society Series B: Statistical Methodology, 84, 174–204.
- Rosenbaum, P. R. (1987), “Sensitivity analysis for certain permutation inferences in matched observational studies,” Biometrika, 74, 13–26.
- — (1989), “Optimal matching for observational studies,” Journal of the American Statistical Association, 84, 1024–1032.
- — (2002a), “Covariance adjustment in randomized experiments and observational studies,” Statistical Science, 17, 286–327.
- — (2020b), “Modern algorithms for matching in observational studies,” Annual Review of Statistics and Its Application, 7, 143–176.
- Rosenbaum, P. R. and Silber, J. H. (2001), “Matching and thick description in an observational study of mortality after surgery,” Biostatistics, 2, 217–232.
- Rubin, D. B. (1980), “Bias Reduction Using Mahalanobis-Metric Matching,” Biometrics, 36, 293–298.
- — (2007), “The design versus the analysis of observational studies for causal effects: parallels with the design of randomized trials,” Statistics in medicine, 26, 20–36.
- — (2008), “For objective causal inference, design trumps analysis,” The Annals of Applied Statistics, 2, 808 – 840.
- Ruzankin, P. S. (2020), “A fast algorithm for maximal propensity score matching,” Methodology and Computing in Applied Probability, 22, 477–495.
- Salpeter, S. R., Walsh, J. M., Greyber, E., and Salpeter, E. E. (2006), “Brief report: coronary heart disease events associated with hormone therapy in younger and older women: a meta-analysis,” Journal of general internal medicine, 21, 363–366.
- Sävje, F. (2021), “On the inconsistency of matching without replacement,” Biometrika.
- Shaikh, A. M. and Toulis, P. (2021), “Randomization tests in observational studies with staggered adoption of treatment,” Journal of the American Statistical Association, 116, 1835–1848.
- Stampfer, M. J. and Colditz, G. A. (1991), “Estrogen replacement therapy and coronary heart disease: a quantitative assessment of the epidemiologic evidence,” Preventive medicine, 20, 47–63.
- Stuart, E. A. (2010), “Matching methods for causal inference: A review and a look forward,” Statistical Science, 25, 1–21.
- Women’s Health Initiative Study Group (1998), “Design of the Women’s Health Initiative clinical trial and observational study,” Controlled clinical trials, 19, 61–109.
- Yu, R., Silber, J. H., and Rosenbaum, P. R. (2020), “Matching methods for observational studies derived from large administrative databases,” Statistical Science, 35, 338–355.
- Yu, R., Small, D. S., Harding, D., Aveldanes, J., and Rosenbaum, P. R. (2021a), “Optimal matching for observational studies that integrate quantitative and qualitative research,” Statistics and Public Policy, 8, 42–52.
- Yu, R., Small, D. S., and Rosenbaum, P. R. (2021b), “The information in covariate imbalance in studies of hormone replacement therapy,” The Annals of Applied Statistics, 15, 2023–2042.
- Zhang, Y. and Zhao, Q. (2023), “What is a randomization test?” Journal of the American Statistical Association, 1–29.