Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
173 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
46 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Seemingly unrelated Bayesian additive regression trees for cost-effectiveness analyses in healthcare (2404.02228v3)

Published 2 Apr 2024 in stat.ME, econ.EM, and stat.AP

Abstract: In recent years, theoretical results and simulation evidence have shown Bayesian additive regression trees to be a highly-effective method for nonparametric regression. Motivated by cost-effectiveness analyses in health economics, where interest lies in jointly modelling the costs of healthcare treatments and the associated health-related quality of life experienced by a patient, we propose a multivariate extension of BART which is applicable in regression analyses with several dependent outcome variables. Our framework allows for continuous or binary outcomes and overcomes some key limitations of existing multivariate BART models by allowing each individual response to be associated with different ensembles of trees, while still handling dependencies between the outcomes. In the case of continuous outcomes, our model is essentially a nonparametric version of seemingly unrelated regression. Likewise, our proposal for binary outcomes is a nonparametric generalisation of the multivariate probit model. We give suggestions for easily interpretable prior distributions, which allow specification of both informative and uninformative priors. We provide detailed discussions of MCMC sampling methods to conduct posterior inference. Our methods are implemented in the R package "subart". We showcase their performance through extensive simulation experiments and an application to an empirical case study from health economics. By also accommodating propensity scores in a manner befitting a causal analysis, we find substantial evidence for a novel trauma care intervention's cost-effectiveness.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (62)
  1. “On the efficiency of some estimators for modeling seemingly unrelated regression with heteroscedastic disturbances.” IOSR Journal of Mathematics, 14(4): 1–13.
  2. “Bayesian inference for a covariance matrix.” In The 26th Annual Conference on Applied Statistics in Agriculture, 71–82. Kansas State University.
  3. Baio, G. (2012). Bayesian Methods in Health Economics. Boca Ration, FL, U.S.A.: Chapman & Hall/CRC Biostatistics Series.
  4. Baldi, P. (2024). Probability: An Introduction Through Theory and Exercises. Universitext. Springer Nature.
  5. “Modeling covariance matrices in terms of standard deviations and correlations, with application to shrinkage.” Statistica Sinica, 1281–1311.
  6. Breiman, L. (1996). “Bagging predictors.” Machine Learning, 24: 123–140.
  7. Classification and Regression Trees. New York, NY, U.S.A.: Chapman and Hall/CRC Press, 1 edition. EBook Published 25 October 2017.
  8. Chakraborty, S. (2016). “Bayesian additive regression tree for seemingly unrelated regression with automatic tree selection.” In Handbook of Statistics, volume 35, 229–251. Elsevier.
  9. “Analysis of multivariate probit models.” Biometrika, 85(2): 347–361.
  10. “Bayesian CART model search.” Journal of the American Statistical Association, 93(443): 935–948.
  11. — (2010). “BART: Bayesian additive regression trees.” The Annals of Applied Statistics, 4(1): 266–298.
  12. “Tail forecasting with multivariate Bayesian additive regression trees.” International Economic Review, 64(3): 979–1022.
  13. “Rationing of total knee replacement: a cost-effectiveness analysis on a large trial data set.” BMJ Open, 2(1): e000332.
  14. dbarts: discrete Bayesian additive regression trees sampler. R package version 0.9-26. URL https://CRAN.R-project.org/package=dbarts
  15. “Automated versus do-it-yourself methods for causal inference: lessons learned from a data analysis competition.” Statistical Science, 34(1): 43–68.
  16. Methods for the Economic Evaluation of Health Care Programmes. Oxford, UK: Oxford University Press, 4th edition.
  17. “A scoping review of statistical methods for trial-based economic evaluations: the current state of play.” Health Economics, 31(12): 2680–2699.
  18. Friedman, J. H. (1991). “Multivariate adaptive regression splines.” The Annals of Statistics, 19(1): 1–67.
  19. — (2001). “Greedy function approximation: a gradient boosting machine.” The Annals of Statistics, 1189–1232.
  20. “Bayesian statistical economic evaluation methods for health technology assessment.” In Hamilton, J. (ed.), Economic Theory and Mathematical Models. Oxford, UK: Oxford Research Encyclopedia of Economics and Finance.
  21. “Strictly proper scoring rules, prediction, and estimation.” Journal of the American Statistical Association, 102(477): 359–378.
  22. “Bayesian regression tree models for causal inference: regularization, confounding, and heterogeneous effects (with discussion).” Bayesian Analysis, 15(3): 965–1056.
  23. “Bayesian backfitting (with comments and a rejoinder by the authors.” Statistical Science, 15(3): 196–223.
  24. Causal Inference: What If. Boca Raton, FL, U.S.A.: Chapman & Hall/CRC Press.
  25. Hill, J. L. (2011). “Bayesian nonparametric modeling for causal inference.” Journal of Computational and Graphical Statistics, 20(1): 217–240.
  26. Hoff, P. D. (2009). A First Course in Bayesian Statistical Methods, volume 580 of Springer Texts in Statistics. New York, NY, U.S.A.: Springer.
  27. ‘‘Simple marginally noninformative prior distributions for covariance matrices.” Bayesian Analysis, 8(2): 439–452.
  28. “Inference in Bayesian additive vector autoregressive tree models.” The Annals of Applied Statistics, 16(1): 104–123.
  29. Causal Inference for Statistics, Social, and Biomedical Sciences: An Introduction. New York, NY, U.S.A.: Cambridge University Press.
  30. “Novel Bayesian additive regression tree methodology for flood susceptibility modeling.” Water Resources Management, 35: 4621–4646.
  31. “bartMachine: machine learning with Bayesian additive regression trees.” Journal of Statistical Software, 70(i04): 1–40.
  32. “Multinomial probit Bayesian additive regression trees.” Stat, 5(1): 119–131.
  33. “The Dutch tariff: results and arguments for an effective design for national EQ-5D valuation studies.” Health Economics, 15(10): 1121–1132.
  34. “Bayesian causal inference: a critical review.” Philosophical Transactions of the Royal Society A, 381(2247): 20220153.
  35. “Bayesian regression tree ensembles that adapt to smoothness and sparsity.” Journal of the Royal Statistical Society Series B: Statistical Methodology, 80(5): 1087–1110.
  36. “Definition, interpretation and calculation of cost-effectiveness acceptability curves.” Health Economics, 9(7): 623–630.
  37. “GP-BART: a novel Bayesian additive regression trees approach using Gaussian processes.” Computational Statistics & Data Analysis, 190: 107858.
  38. “Bayesian causal forests for multivariate outcomes: application to Irish data from an international large scale education assessment.”
  39. “Bayesian non-parametric models for spatially indexed data of mixed type.” Journal of the Royal Statistical Society Series B: Statistical Methodology, 77(5): 973–999.
  40. “Spatial multivariate trees for big data Bayesian regression.” The Journal of Machine Learning Research, 23(1): 747–786.
  41. “Multivariate stochastic process models for correlated responses of mixed type.” Bayesian Analysis, 11(3): 797 – 820.
  42. “Bayesian additive regression trees with model trees.” Statistics and Computing, 31(20): 1–13.
  43. “Heteroscedastic BART via multiplicative regression trees.” Journal of Computational and Graphical Statistics, 29(2): 405–417.
  44. ‘‘Participation in marijuana, cocaine and heroin consumption in Australia: a multivariate probit approach.” Applied Economics, 41(4): 481–496.
  45. Robert, C. P. (1995). “Simulation of truncated normal variables.” Statistics and Computing, 5: 121–125.
  46. Rocková, V. (2020). “On semi-parametric inference for BART.” In International Conference on Machine Learning, 8137–8146. PMLR.
  47. “On theory for BART.” In The 22nd International Conference on Artificial Intelligence and Statistics, 2839–2848. PMLR.
  48. “Posterior concentration for Bayesian regression trees and forests.” Annals of Statistics, 48(4): 2108–2131.
  49. “All models are wrong, but which are useful? Comparing parametric and nonparametric estimation of causal effects in finite samples.” Journal of Causal Inference, 11(1): 20230022.
  50. “Bayesian additive regression trees for genotype by environment interaction models.” The Annals of Applied Statistics, 17(3): 1936–1957.
  51. Stan Development Team (2024a). “RStan: the R interface to Stan.” R package version 2.32.5. URL https://mc-stan.org/
  52. — (2024b). Stan User’s Guide. Stan Development Team. URL https://mc-stan.org/docs/
  53. “Bayesian additive regression trees for multivariate skewed responses.” Statistics in Medicine, 42(3): 246–263.
  54. ‘‘Predictive mean matching imputation of semicontinuous variables.” Statistica Neerlandica, 68(1): 61–90.
  55. “Mean field variational Bayes for elaborate distributions.” Bayesian Analysis, 6(4): 847–900.
  56. “Cost-effectiveness of the transmural trauma care model (TTCM) for the rehabilitation of trauma patients.” International Journal of Technology Assessment in Health Care, 35(4): 307–316.
  57. ‘‘Regression methods for covariate adjustment and subgroup analysis for non-censored cost-effectiveness data.” Health Economics, 13(5): 461–475.
  58. “Evaluating plate discipline in Major League Baseball with Bayesian additive regression trees.” Journal of Quantitative Analysis in Sports. Advanced online publication.
  59. Zellner, A. (1962). “An efficient method of estimating seemingly unrelated regressions and tests for aggregation bias.” Journal of the American Statistical Association, 57(298): 348–368.
  60. Zhang, X. (2020). “Parameter-expanded data augmentation for analyzing correlated binary data using multivariate probit models.” Statistics in Medicine, 39(25): 3637–3652.
  61. “Sampling correlation matrices in Bayesian models with correlated latent variables.” Journal of Computational and Graphical Statistics, 15(4): 880–896.
  62. “A Bayesian method for analyzing combinations of continuous, ordinal, and nominal categorical data with missing values.” Journal of Multivariate Analysis, 135: 43–58.

Summary

We haven't generated a summary for this paper yet.