Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
125 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Case-Base Neural Networks: survival analysis with time-varying, higher-order interactions (2301.06535v4)

Published 16 Jan 2023 in stat.ML and cs.LG

Abstract: In the context of survival analysis, data-driven neural network-based methods have been developed to model complex covariate effects. While these methods may provide better predictive performance than regression-based approaches, not all can model time-varying interactions and complex baseline hazards. To address this, we propose Case-Base Neural Networks (CBNNs) as a new approach that combines the case-base sampling framework with flexible neural network architectures. Using a novel sampling scheme and data augmentation to naturally account for censoring, we construct a feed-forward neural network that includes time as an input. CBNNs predict the probability of an event occurring at a given moment to estimate the full hazard function. We compare the performance of CBNNs to regression and neural network-based survival methods in a simulation and three case studies using two time-dependent metrics. First, we examine performance on a simulation involving a complex baseline hazard and time-varying interactions to assess all methods, with CBNN outperforming competitors. Then, we apply all methods to three real data applications, with CBNNs outperforming the competing models in two studies and showing similar performance in the third. Our results highlight the benefit of combining case-base sampling with deep learning to provide a simple and flexible framework for data-driven modeling of single event survival outcomes that estimates time-varying effects and a complex baseline hazard by design. An R package is available at https://github.com/Jesse-Islam/cbnn.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (64)
  1. keras: R Interface to ’Keras’. URL: https://CRAN.R-project.org/package=keras. r package version 2.7.0.
  2. Predicting cardiovascular risk from national administrative databases using a combined survival analysis and deep learning approach. International Journal of Epidemiology 51, 931–944.
  3. casebase: An alternative framework for survival analysis and comparison of event rates. arXiv preprint arXiv:2009.10264 .
  4. The r journal: casebase: An alternative framework for survival analysis and comparison of event rates. The R Journal 14, 59–79. doi:10.32614/RJ-2022-052. https://doi.org/10.32614/RJ-2022-052.
  5. Deep learning-based survival analysis for brain metastasis patients with the national cancer database. Journal of applied clinical medical physics 21, 187–192.
  6. Quantifying and comparing dynamic predictive accuracy of joint models for longitudinal marker and time-to-event in presence of censoring and competing risks. Biometrics 71, 102–113.
  7. Simulating survival data using the simsurv R package. Journal of Statistical Software 97, 1–27. doi:10.18637/jss.v097.i03.
  8. The pathological risk score: A new deep learning-based signature for predicting survival in cervical cancer. Cancer Medicine 12, 1051–1063.
  9. Pathomic fusion: an integrated framework for fusing histopathology and genomic features for cancer diagnosis and prognosis. IEEE Transactions on Medical Imaging 41, 757–770.
  10. Cox-nnet: an artificial neural network method for prognosis prediction of high-throughput omics data. PLoS computational biology 14, e1006076.
  11. Time-dependent relevance of steroid receptors in breast cancer. Journal of clinical oncology 18, 2702–2709.
  12. Regression models and life-tables. Journal of the Royal Statistical Society: Series B (Methodological) 34, 187–202.
  13. Use of nonclonal serum immunoglobulin free light chains to predict overall survival in the general population, in: Mayo Clinic Proceedings, Elsevier. pp. 517–523.
  14. International myeloma working group guidelines for serum-free light chain analysis in multiple myeloma and related disorders. Leukemia 23, 215–224.
  15. Time-varying risk factors for long-term mortality after coronary artery bypass graft surgery. The Annals of thoracic surgery 81, 793–799.
  16. A scalable discrete-time survival model for neural networks. PeerJ 7, e6257.
  17. Medical Risk Prediction Models: With Ties to Machine Learning (1st ed.). Chapman and Hall/CRC. URL: https://doi.org/10.1201/9781138384484. r package version 2021.10.10.
  18. Rnn-surv: A deep recurrent model for survival analysis, in: Artificial Neural Networks and Machine Learning–ICANN 2018: 27th International Conference on Artificial Neural Networks, Rhodes, Greece, October 4-7, 2018, Proceedings, Part III 27, Springer. pp. 23–32.
  19. Assessment and comparison of prognostic classification schemes for survival data. Statistics in medicine 18, 2529–2545.
  20. Deep learning with Keras. Packt Publishing Ltd.
  21. Fitting smooth-in-time prognostic risk functions via logistic regression. The International Journal of Biostatistics 5.
  22. Deep learning-based survival analysis for high-dimensional survival data. Mathematics 9, 1244.
  23. Joint learning sample similarity and correlation representation for cancer survival prediction. BMC bioinformatics 23, 553.
  24. Deep learning-based cancer survival prognosis from rna-seq data: approaches and evaluations. BMC medical genomics 13, 1–12.
  25. Calculus: Single and multivariable. John Wiley & Sons.
  26. flexsurv: A platform for parametric survival modeling in R. Journal of Statistical Software 70, 1–33. doi:10.18637/jss.v070.i08. r package version 2.0.
  27. Dccafn: deep convolution cascade attention fusion network based on imaging genomics for prediction survival analysis of lung cancer. Complex & Intelligent Systems , 1–16.
  28. The index of prediction accuracy: an intuitive measure useful for evaluating risk prediction models. Diagnostic and prognostic research 2, 1–7.
  29. Deepsurv: personalized treatment recommender system using a cox proportional hazards deep neural network. BMC Medical Research Methodology 18, 24.
  30. Pideel: metabolic pathway-informed deep learning model for survival analysis and pathological classification of gliomas. Bioinformatics 39, btad684.
  31. Deep learning-based survival prediction of oral cancer patients. Scientific reports 9, 1–10.
  32. Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980 .
  33. Deeppamm: Deep piecewise exponential additive mixed models for complex hazard structures in survival analysis, in: Pacific-Asia Conference on Knowledge Discovery and Data Mining, Springer. pp. 249–261.
  34. Long term survival in multiple myeloma. New Eng J Medicine .
  35. Deephit: A deep learning approach to survival analysis with competing risks. AAAI Conference on Artificial Intelligence , 2314–2321Software from pycox version 0.2.2.
  36. Hfbsurv: hierarchical multimodal fusion with factorized bilinear models for cancer survival prediction. Bioinformatics 38, 2587–2594.
  37. Outcomes of localized prostate cancer following conservative management. Jama 302, 1202–1209.
  38. Synthetic retrospective studies and related topics. Biometrics , 479–486.
  39. A novel attention-mechanism based cox survival model by exploiting pan-cancer empirical genomic information. Cells 11, 1421.
  40. Predicting cancer outcomes from histology and genomics using convolutional networks. Proceedings of the National Academy of Sciences 115, E2970–E2979.
  41. asaur: Data Sets for ”Applied Survival Analysis Using R””. URL: https://CRAN.R-project.org/package=asaur. r package version 0.50.
  42. Time-varying and dose-dependent effect of long-term statin use on risk of type 2 diabetes: a retrospective cohort study. Cardiovascular diabetology 19, 1–11.
  43. Deep parametric time-to-event regression with time-varying covariates, in: Survival Prediction-Algorithms, Challenges and Applications, PMLR. pp. 184–193.
  44. Deep survival machines: Fully parametric survival regression and representation learning for censored data with competing risks. IEEE Journal of Biomedical and Health Informatics Software downloaded March 10, 2021.
  45. R Core Team, 2021. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing. Vienna, Austria. URL: https://www.R-project.org/. version 4.1.2.
  46. Flexible parametric proportional-hazards and proportional-odds models for censored survival data, with application to prognostic modelling and estimation of treatment effects. Statistics in medicine 21, 2175–2197.
  47. A case-base sampling method for estimating recurrent event intensities. Lifetime data analysis 22, 589–605.
  48. Case-base methods for studying vaccination safety. Biometrics 71, 42–52.
  49. Clinical effectiveness reporting of novel cancer drugs in the context of non-proportional hazards: a review of nice single technology appraisals. International Journal of Technology Assessment in Health Care 39, e16.
  50. Development and validation of a deep learning model for non-small cell lung cancer survival. JAMA network open 3, e205842–e205842.
  51. Dropout: A simple way to prevent neural networks from overfitting. The Journal of Machine Learning Research 15, 1929–1958.
  52. Explainable survival analysis with uncertainty using convolution-involved vision transformer. Computerized Medical Imaging and Graphics 110, 102302.
  53. Modeling Survival Data: Extending the Cox Model. Springer, New York. R package version 3.2-11.
  54. reticulate: Interface to ’Python’. URL: https://CRAN.R-project.org/package=reticulate. r package version 1.22.
  55. Long-term cancer survival prediction using multimodal deep learning. Scientific Reports 11, 13505.
  56. Python 3 Reference Manual. CreateSpace, Scotts Valley, CA. Version 3.8.12.
  57. Gpdbn: deep bilinear network integrating both genomic data and pathological images for breast cancer prognosis prediction. Bioinformatics 37, 2963–2970.
  58. Deep learning-based survival prediction for multiple cancer types using histopathology images. PloS one 15, e0233678.
  59. Cox-resnet: A survival analysis model based on residual neural networks for gene expression data, in: 2022 IEEE International Conference on Networking, Sensing and Control (ICNSC), IEEE. pp. 1–6.
  60. Deep-learning model for predicting the survival of rectal adenocarcinoma patients based on a surveillance, epidemiology, and end results analysis. BMC cancer 22, 1–14.
  61. Deepsurvnet: deep survival convolutional network for brain cancer survival rate classification based on histopathological images. Medical & biological engineering & computing 58, 1031–1045.
  62. A survey on neural network interpretability. IEEE Transactions on Emerging Topics in Computational Intelligence 5, 726–742.
  63. Time-varying association between body mass index and all-cause mortality in patients with hypertension. International Journal of Obesity 46, 316–324.
  64. Wsisa: Making survival prediction from whole slide histopathological images, in: Proceedings of the IEEE conference on computer vision and pattern recognition, pp. 7234–7242.

Summary

We haven't generated a summary for this paper yet.

Github Logo Streamline Icon: https://streamlinehq.com