Benchmarking Observational Studies with Experimental Data under Right-Censoring (2402.15137v1)
Abstract: Drawing causal inferences from observational studies (OS) requires unverifiable validity assumptions; however, one can falsify those assumptions by benchmarking the OS with experimental data from a randomized controlled trial (RCT). A major limitation of existing procedures is not accounting for censoring, despite the abundance of RCTs and OSes that report right-censored time-to-event outcomes. We consider two cases where censoring time (1) is independent of time-to-event and (2) depends on time-to-event the same way in OS and RCT. For the former, we adopt a censoring-doubly-robust signal for the conditional average treatment effect (CATE) to facilitate an equivalence test of CATEs in OS and RCT, which serves as a proxy for testing if the validity assumptions hold. For the latter, we show that the same test can still be used even though unbiased CATE estimation may not be possible. We verify the effectiveness of our censoring-aware tests via semi-synthetic experiments and analyze RCT and OS data from the Women's Health Initiative study.
- Mostly Harmless Econometrics: An Empiricist’s Companion. Princeton University Press, 2009.
- Effects of early intervention on cognitive function of low birth weight preterm infants. The Journal of pediatrics, 120(3):350–359, 1992.
- Enabling counterfactual survival analysis with balanced representations. In Proceedings of the Conference on Health, Inference, and Learning, pages 133–145, 2021.
- Adjusted survival curves with inverse probability weights. Computer Methods and Programs in Biomedicine, 75(1):45–49, 2004.
- Causal inference methods for combining randomized trials and observational studies: a review. arXiv preprint arXiv:2011.08047, 2020.
- David R Cox. Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological), 34(2):187–202, 1972.
- Survite: Learning heterogeneous treatment effects from time-to-event data. Advances in Neural Information Processing Systems, 34:26740–26753, 2021.
- Bnt162b2 mrna covid-19 vaccine in a nationwide mass vaccination setting. New England Journal of Medicine, 2021.
- Generalizing causal inferences from individuals in randomized trials to all trial-eligible individuals. Biometrics, 75(2):685–694, June 2019.
- Extending inferences from a randomized trial to a new target population. Statistics in medicine, 39(14):1999–2014, 2020a.
- Benchmarking observational methods by comparing randomized trials and their emulations. Epidemiology, 31(5):614–619, 2020b.
- Cameron Davidson-Pilon. lifelines: survival analysis in python. Journal of Open Source Software, 4(40):1317, 2019. URL https://lifelines.readthedocs.io/en/latest/fitters/regression/CoxPHFitter.html.
- Hidden yet quantifiable: A lower bound for confounding strength using randomized trials. arXiv preprint arXiv:2312.03871, 2023.
- Testing for the unconfoundedness assumption using an instrumental assumption. Journal of Causal Inference, 2(2):187–199, 2014.
- Benchmarking observational analyses against randomized trials: a review of studies assessing propensity score methods. Journal of general internal medicine, 35:1396–1404, 2020.
- Using observational data for personalized medicine when clinical trial evidence is limited. Fertility and Sterility, 109(6):946–951, 2018.
- Copula-based deep survival models for dependent censoring. In Uncertainty in Artificial Intelligence, pages 669–680. PMLR, 2023.
- Government of Canada. Optimizing the use of real world evidence to inform regulatory decision-making, 2019. URL https://www.canada.ca/en/health-canada/services/drugs-health-products/drug-products/announcements/optimizing-real-world-evidence-regulatory-decisions.html.
- Reporting of observational studies explicitly aiming to emulate randomized trials: A systematic review. JAMA Network Open, 6(9):e2336023–e2336023, 2023.
- From sample average treatment effect to population average treatment effect on the treated: combining experimental with observational studies to estimate population treatment effects. Journal of the Royal Statistical Society. Series A,, 178(3):757–778, June 2015.
- Using big data to emulate a target trial when a randomized trial is not available. American Journal of Epidemiology, 183(8):758–764, 2016.
- Causal Inference. CRC Press, Boca Raton, FL, February 2021.
- A structural approach to selection bias. Epidemiology, pages 615–625, 2004.
- Observational studies analyzed like randomized experiments: an application to postmenopausal hormone therapy and coronary heart disease. Epidemiology (Cambridge, Mass.), 19(6):766, 2008.
- Per-protocol analyses of pragmatic trials. N Engl J Med, 377(14):1391–1398, 2017.
- Falsification before extrapolation in causal effect estimation. arxiv preprint arXiv:2209.13708, 2022.
- Falsification of internal and external validity in observational studies via conditional moment restrictions. In International Conference on Artificial Intelligence and Statistics, pages 5869–5898, 2023.
- Causal inference in statistics, social, and biomedical sciences. Cambridge University Press, 2015.
- The statistical analysis of failure time data. John Wiley & Sons, 2011.
- Detecting hidden confounding in observational data using multiple environments. Advances in Neural Information Processing Systems, 36, 2024.
- Censoring issues in survival analysis. Annual Review of Public Health, 18(1):83–104, 1997.
- Negative controls: a tool for detecting confounding and bias in observational studies. Epidemiology (Cambridge, Mass.), 21(3):383, 2010.
- Effect estimates in randomized trials and observational studies: comparing apples with apples. American Journal of Epidemiology, 188(8):1569–1577, 2019.
- Kernel conditional moment test via maximum moment restriction. In Conference on Uncertainty in Artificial Intelligence, pages 41–50. PMLR, 2020.
- NICE. Nice real-world evidence framework, 2022. URL https://www.nice.org.uk/corporate/ecd9/chapter/overview.
- The Book of Why: The New Science of Cause and Effect. Basic books, 2018.
- Combined postmenopausal hormone therapy and cardiovascular disease: toward resolving the discrepancy between observational studies and the women’s health initiative clinical trial. American journal of epidemiology, 162(5):404–414, 2005.
- Peter M Rothwell. External validity of randomised controlled trials:“to whom do the results of this trial apply?”. The Lancet, 365(9453):82–93, 2005.
- Daniel Rubin and Mark J van der Laan. A doubly robust censoring unbiased transformation. The international journal of biostatistics, 3(1), 2007.
- Debiased machine learning of conditional average treatment effects and other causal functions. The Econometrics Journal, 24(2):264–289, 2021.
- On negative outcome control of unobserved confounding as a generalization of difference-in-differences. Statistical science: a review journal of the Institute of Mathematical Statistics, 31(3):348, 2016.
- SPRINT Research Group. A randomized trial of intensive versus standard blood-pressure control. New England Journal of Medicine, 373(22):2103–2116, 2015.
- The use of propensity scores to assess the generalizability of results from randomized trials. Journal of the Royal Statistical Society: Series A (Statistics in Society), 174(2):369–386, 2011.
- Therapeutic Goods Administration. Real world evidence and patient reported outcomes in the regulatory context, 2023. URL https://www.tga.gov.au/real-world-evidence-rwe-and-patient-reported-outcomes-pros.
- Estimation of the conditional distribution in regression with censored data: a comparative study. Computational Statistics & Data Analysis, 35(4):487–500, 2001.
- Use of historical control data for assessing treatment effects in clinical trials. Pharmaceutical statistics, 13(1):41–54, 2014.
- Estimation and inference of heterogeneous treatment effects using random forests. Journal of the American Statistical Association, 113(523):1228–1242, 2018.
- Emulation of randomized clinical trials with nonrandomized database analyses: results of 32 clinical trials. JAMA, 329(16):1376–1385, 2023.
- Immortal time bias in observational studies. Jama, 325(7):686–687, 2021.
- Elastic integrative analysis of randomised trial and real-world data for treatment heterogeneity estimation. Journal of the Royal Statistical Society Series B: Statistical Methodology, 85(3):575–596, 2023.