Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
144 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Calibrated sensitivity models (2405.08738v3)

Published 14 May 2024 in stat.ME

Abstract: In causal inference, sensitivity models assess how unmeasured confounders could alter causal analyses, but the sensitivity parameter -- which quantifies the degree of unmeasured confounding -- is often difficult to interpret. For this reason, researchers sometimes compare the sensitivity parameter to an estimate of measured confounding. This is known as calibration, or benchmarking. Although it can aid interpretation, calibration is typically conducted post hoc, and uncertainty in the estimate for unmeasured confounding is rarely accounted for. To address these limitations, we propose calibrated sensitivity models, which directly bound the degree of unmeasured confounding by a multiple of measured confounding. The calibrated sensitivity parameter is interpretable as a ratio of unmeasured to measured confounding, and uncertainty due to estimating measured confounding can be incorporated. Incorporating this uncertainty shows causal analyses can be less or more robust to unmeasured confounding than suggested by standard approaches. We develop efficient estimators and inferential methods for bounds on the average treatment effect with three calibrated sensitivity models, establishing parametric efficiency and asymptotic normality under doubly robust style nonparametric conditions. We illustrate our methods with an analysis of the effect of mothers' smoking on infant birthweight.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (94)
  1. Alan Agresti. Foundations of linear and generalized linear models. John Wiley & Sons, 2015.
  2. The costs of low birth weight. The Quarterly Journal of Economics, 120(3):1031–1083, 2005.
  3. Fast learning rates for plug-in classifiers. The Annals of Statistics, pages 608–633, 2007.
  4. Rudolf Beran. Minimum hellinger distance estimates for parametric models. The annals of Statistics, pages 445–463, 1977.
  5. Efficient and Adaptive Estimation for Semiparametric Models. Baltimore: Johns Hopkins University Press, 1993.
  6. Estimation of integral functionals of a density. The Annals of Statistics, 23(1):11–29, 1995.
  7. Sensitivity analysis via the proportion of unmeasured confounding. Journal of the American Statistical Association, 117(539):1540–1550, 2022.
  8. Sensitivity analysis for marginal structural models. arXiv preprint arXiv:2210.04681, 2022.
  9. Sensitivity analyses for unmeasured confounding assuming a marginal structural model for repeated measures. Statistics in medicine, 23(5):749–767, 2004.
  10. Assessing sensitivity to unmeasured confounding using a simulated potential confounder. Journal of Research on Educational Effectiveness, 9(3):395–420, 2016.
  11. Matias D Cattaneo. Efficient semiparametric estimation of multi-valued treatment effects under ignorability. Journal of Econometrics, 155(2):138–154, 2010.
  12. Debiased machine learning without sample-splitting for stable estimators. Advances in Neural Information Processing Systems, 35:3096–3109, 2022.
  13. Double/debiased machine learning for treatment and structural parameters. The Econometrics Journal, 21(1):C1–C68, 2018.
  14. Long story short: Omitted variable bias in causal machine learning. Technical report, National Bureau of Economic Research, 2022.
  15. Making sense of sensitivity: Extending omitted variable bias. Journal of the Royal Statistical Society Series B-Statistical Methodology, 82(1):39–67, 2020.
  16. sensemakr: Sensitivity analysis tools for ols in r and stata. Available at SSRN 3588978, 2020.
  17. Smoking and lung cancer: recent evidence and a discussion of some questions*. International Journal of Epidemiology, 38(5):1175–1191, October 1959. ISSN 1464-3685, 0300-5771.
  18. A non-parametric projection-based estimator for the probability of causation, with application to water sanitation in kenya. Journal of the Royal Statistical Society Series A: Statistics in Society, 183(4):1793–1818, 2020.
  19. Iván Díaz and Mark J van der Laan. Sensitivity analysis for causal inference under unmeasured confounding and measurement error problems. The international journal of biostatistics, 9(2):149–160, 2013.
  20. Targeted learning ensembles for optimal individualized treatment rules with time-to-event outcomes. Biometrika, 105(3):723–738, 2018.
  21. Sensitivity Analysis Without Assumptions. Epidemiology, 27(3):368–377, 2016. ISSN 1044-3983.
  22. Sharp sensitivity analysis for inverse propensity weighting via quantile balancing. Journal of the American Statistical Association, 118(544):2645–2657, 2023.
  23. Doubly-valid/doubly-sharp sensitivity analysis for causal inference with unmeasured confounding. Journal of the American Statistical Association, (just-accepted):1–23, 2024.
  24. Bradley Efron. Bootstrap methods: another look at the jackknife. In Breakthroughs in statistics: Methodology and distribution, pages 569–593. Springer, 1992.
  25. Three-way cross-fitting and pseudo-outcome regression for estimation of conditional effects and other linear functionals. arXiv preprint arXiv:2306.07230, 2023.
  26. Flexible sensitivity analysis for observational studies without observable implications. Journal of the American Statistical Association, 115(532):1730–1746, 2020.
  27. A distribution-free theory of nonparametric regression, volume 1. Springer, 2002.
  28. Statistical learning with sparsity. Monographs on statistics and applied probability, 143(143):8, 2015.
  29. Chad Hazlett. Angry or weary? how violence impacts attitudes toward peace among darfurian refugees. Journal of Conflict Resolution, 64(5):844–870, 2020.
  30. Demystifying statistical learning based on efficient influence functions. The American Statistician, 76(3):292–304, 2022.
  31. Calibrating sensitivity analyses to observed covariates in observational studies. Biometrics, 69(4):803–811, 2013.
  32. Variance-based sensitivity analysis for weighting estimators result in more informative bounds. arXiv preprint arXiv:2208.01691, 2022.
  33. Peter J Huber et al. The behavior of maximum likelihood estimates under nonstandard conditions. In Proceedings of the fifth Berkeley symposium on mathematical statistics and probability, volume 1, pages 221–233. Berkeley, CA: University of California Press, 1967.
  34. Guido W Imbens. Sensitivity to exogeneity assumptions in program evaluation. American Economic Review, 93(2):126–132, 2003.
  35. Sensitivity analysis under the f𝑓fitalic_f-sensitivity models: a distributional robustness perspective. arXiv preprint arXiv:2203.04373, 2022.
  36. Edward H Kennedy. Semiparametric doubly robust targeted double machine learning: a review. arXiv preprint arXiv:2203.06469, 2022.
  37. Edward H. Kennedy. npcausal: Nonparametric causal inference methods, 2023a. R package version 0.1.0.
  38. Edward H Kennedy. Towards optimal doubly robust estimation of heterogeneous causal effects. Electronic Journal of Statistics, 17(2):3008–3049, 2023b.
  39. Sharp instruments for classifying compliers and generalizing causal effects. The Annals of Statistics, 48(4):2008 – 2030, 2020.
  40. Semiparametric counterfactual density estimation. Biometrika, page asad017, 2023.
  41. Béatrice Laurent. Estimation of integral functionals of a density and its derivatives. Bernoulli, pages 181–211, 1997.
  42. Distribution-free predictive inference for regression. Journal of the American Statistical Association, 113(523):1094–1111, 2018.
  43. On estimation of the l r norm of a regression function. Probability theory and related fields, 113:221–253, 1999.
  44. An Introduction to Sensitivity Analysis for Unobserved Confounding in Nonexperimental Prevention Research. Prevention Science, 14(6):570–580, 2013.
  45. Flexible sensitivity analysis for causal inference in observational studies subject to unmeasured confounding. arXiv preprint arXiv:2305.17643, 2023.
  46. The statistics of sensitivity analyses. U.C. Berkeley Division of Biostatistics Working Paper Series, 341, 2015.
  47. Charles F. Manski. Nonparametric bounds on treatment effects. The American Economic Review, 80(2):319–323, 1990.
  48. Identification of treatment effects under conditional partial independence. Econometrica, 86(1):317–351, 2018.
  49. Assessing sensitivity to unconfoundedness: Estimation and inference. Journal of Business & Economic Statistics, pages 1–13, 2023.
  50. Nonparametric estimation of conditional incremental effects. Journal of Causal Inference, 12(1):20230024, 2024.
  51. RV Mises. On the asymptotic distribution of differentiable statistical functions. The annals of mathematical statistics, 18(3):309–348, 1947.
  52. Targeted maximum likelihood estimation for dynamic and static longitudinal marginal structural working models. Journal of causal inference, 2(2):147–185, 2014.
  53. R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2021. URL https://www.R-project.org/.
  54. Nonparametric Bounds and Sensitivity Analysis of Treatment Effects. Statistical Science, 29(4), 2014.
  55. Higher order influence functions and minimax estimation of nonlinear functionals. In Institute of Mathematical Statistics Collections, pages 335–421. Institute of Mathematical Statistics, 2008.
  56. James M Robins. The analysis of randomized and non-randomized aids treatment trials using a new approach to causal inference in longitudinal studies. Health service research methodology: a focus on AIDS, pages 113–159, 1989.
  57. James M Robins. Association, causation, and marginal structural models. Synthese, 121(1/2):151–179, 1999.
  58. Semiparametric efficiency in multivariate regression models with missing data. Journal of the American Statistical Association, 90(429):122–129, 1995.
  59. Estimation of regression coefficients when some regressors are not always observed. Journal of the American statistical Association, 89(427):846–866, 1994.
  60. Analysis of semiparametric regression models for repeated outcomes in the presence of missing data. Journal of the american statistical association, 90(429):106–121, 1995.
  61. Sensitivity analysis for selection bias and unmeasured confounding in missing data and causal inference models. In M. Elizabeth Halloran and Donald Berry, editors, Statistical Models in Epidemiology, the Environment, and Clinical Trials, pages 1–94, New York, NY, 2000. Springer New York.
  62. Paul R. Rosenbaum. Sensitivity to Hidden Bias. Springer New York, New York, NY, 2002.
  63. Assessing sensitivity to an unobserved binary covariate in an observational study with binary outcome. Journal of the Royal Statistical Society: Series B (Methodological), 45(2):212–218, 1983.
  64. Distributionally robust and generalizable inference. Statistical Science, 38(4):527–542, 2023.
  65. The shapley value in machine learning. In The 31st International Joint Conference on Artificial Intelligence and the 25th European Conference on Artificial Intelligence, pages 5572–5579. International Joint Conferences on Artificial Intelligence Organization, 2022.
  66. Heterogeneous interventional indirect effects with multiple mediators: non-parametric and semi-parametric approaches. arXiv preprint arXiv:2210.08272, 2022.
  67. Semiparametric sensitivity analysis: Unmeasured confounding in observational studies. arXiv preprint arXiv:2104.08300, 2021.
  68. Debiased machine learning of conditional average treatment effects and other causal functions. The Econometrics Journal, 24(2):264–289, 2021.
  69. The jackknife and bootstrap. Springer Science & Business Media, 2012.
  70. Lloyd S Shapley. A value for n-person games. 1953.
  71. Interpretable sensitivity analysis for balancing weights. Journal of the Royal Statistical Society Series A: Statistics in Society, 186(4):707–721, 2023.
  72. Zhiqiang Tan. A distributional approach for causal inference using propensity scores. Journal of the American Statistical Association, 101(476):1619–1637, 2006.
  73. Eric J Tchetgen Tchetgen and Tyler J VanderWeele. On causal inference in the presence of interference. Statistical methods in medical research, 21(1):55–75, 2012.
  74. Anastasios A Tsiatis. Semiparametric Theory and Missing Data. New York: Springer, 2006.
  75. Alexandre B Tsybakov. Introduction to Nonparametric Estimation. New York: Springer, 2009.
  76. Mark J Van der Laan and James M Robins. Unified methods for censored longitudinal data and causality, volume 5. Springer, 2003.
  77. Super learner. Statistical Applications in Genetics and Molecular Biology, 6(1), 2007.
  78. Sensitivity analysis. Targeted Learning in Data Science: Causal Inference for Complex Longitudinal Studies, pages 511–522, 2018.
  79. Aad W van der Vaart. Asymptotic Statistics. Cambridge: Cambridge University Press, 2000.
  80. Aad W van der Vaart and Jon A Wellner. Weak Convergence and Empirical Processes. Springer, 1996.
  81. AW van der Vaart. Semiparametric Statistics. Springer, 2002.
  82. Bias formulas for sensitivity analysis of unmeasured confounding for general outcomes, treatments, and confounders. Epidemiology, pages 42–52, 2011.
  83. Sensitivity analysis in observational research: introducing the e-value. Annals of internal medicine, 167(4):268–274, 2017.
  84. Sense and sensitivity analysis: Simple post-hoc analysis of bias due to unobserved confounding. In H. Larochelle, M. Ranzato, R. Hadsell, M.F. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 10999–11009. Curran Associates, Inc., 2020.
  85. Invited commentary: positivity in practice. American journal of epidemiology, 171(6):674–677, 2010.
  86. Halbert White. Using least squares to approximate unknown regression functions. International economic review, pages 149–170, 1980.
  87. Nonparametric variable importance assessment using machine learning techniques. Biometrics, 77(1):9–22, March 2021.
  88. ranger: A fast implementation of random forests for high dimensional data in C++ and R. Journal of Statistical Software, 77(1):1–17, 2017.
  89. Bounds on the conditional and average treatment effect with unobserved confounding factors. The Annals of Statistics, 50(5):2587–2615, 2022.
  90. Bo Zhang and Dylan S Small. A calibrated sensitivity analysis for matched observational studies with application to the effect of second-hand smoke exposure on blood lead levels in children. Journal of the Royal Statistical Society Series C: Applied Statistics, 69(5):1285–1305, 2020.
  91. Bo Zhang and Eric J Tchetgen Tchetgen. A semi-parametric approach to model-based sensitivity analysis in observational studies. Journal of the Royal Statistical Society Series A: Statistics in Society, 185(Supplement_2):S668–S691, 2022.
  92. Bounds and semiparametric inference in l∞superscript𝑙l^{\infty}italic_l start_POSTSUPERSCRIPT ∞ end_POSTSUPERSCRIPT- and l2superscript𝑙2l^{2}italic_l start_POSTSUPERSCRIPT 2 end_POSTSUPERSCRIPT- sensitivity analysis for observational studies. arXiv preprint arXiv:2211.04697, 2022.
  93. Sensitivity analysis for inverse probability weighting estimators via the percentile bootstrap. Journal of the Royal Statistical Society Series B: Statistical Methodology, 81(4):735–761, 2019.
  94. Wenjing Zheng and Mark J van der Laan. Asymptotic theory for cross-validated targeted maximum likelihood estimation. U.C. Berkeley Division of Biostatistics Working Paper Series, 2010.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com