Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 90 tok/s
Gemini 2.5 Pro 29 tok/s Pro
GPT-5 Medium 14 tok/s Pro
GPT-5 High 17 tok/s Pro
GPT-4o 101 tok/s Pro
Kimi K2 195 tok/s Pro
GPT OSS 120B 456 tok/s Pro
Claude Sonnet 4 39 tok/s Pro
2000 character limit reached

On Training Survival Models with Scoring Rules (2403.13150v2)

Published 19 Mar 2024 in cs.LG, cs.AI, stat.CO, and stat.ML

Abstract: Scoring rules are an established way of comparing predictive performances across model classes. In the context of survival analysis, they require adaptation in order to accommodate censoring. This work investigates using scoring rules for model training rather than evaluation. Doing so, we establish a general framework for training survival models that is model agnostic and can learn event time distributions parametrically or non-parametrically. In addition, our framework is not restricted to any specific scoring rule. While we focus on neural network-based implementations, we also provide proof-of-concept implementations using gradient boosting, generalized additive models, and trees. Empirical comparisons on synthetic and real-world data indicate that scoring rules can be successfully incorporated into model training and yield competitive predictive performance with established time-to-event models.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (32)
  1. Ahmed M Alaa and Mihaela van der Schaar. Deep multi-task gaussian processes for survival analysis with competing risks. In Proceedings of the 31st International Conference on Neural Information Processing Systems, pages 2326–2334. Curran Associates Inc., 2017.
  2. Countdown regression: sharp and calibrated survival predictions. In Uncertainty in Artificial Intelligence, pages 145–155. PMLR, 2020.
  3. Penalized estimation of complex, non-linear exposure-lag-response associations. Biostatistics, 20(2):315–331, April 2019.
  4. The c-index is not proper for the evaluation of-year predicted risks. Biostatistics, 20(2):347–357, 2019.
  5. D. R. Cox. Regression Models and Life-Tables. Journal of the Royal Statistical Society. Series B (Methodological), 34(2):187–220, 1972.
  6. The genomic and transcriptomic architecture of 2,000 breast tumours reveals novel subgroups. Nature, 486(7403):346–352, 2012.
  7. Theory and applications of proper scoring rules. Metron, 72(2):169–183, 2014.
  8. Semi-structured subspace inference. In Proceedings of The 27th International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research. PMLR, 2024. Accepted.
  9. Frequentist uncertainty quantification in semi-structured neural networks. In Proceedings of The 26th International Conference on Artificial Intelligence and Statistics, volume 206 of Proceedings of Machine Learning Research, pages 1924–1941. PMLR, 2023.
  10. A scalable discrete-time survival model for neural networks. PeerJ, 7:e6257, January 2019.
  11. Assessment and comparison of prognostic classification schemes for survival data. Statistics in medicine, 18:2529–2545, 1999.
  12. Pitfalls of the concordance index for survival outcomes. Statistics in Medicine, 2023.
  13. Deepsurv: personalized treatment recommender system using a cox proportional hazards deep neural network. BMC medical research methodology, 18(1):1–12, 2018.
  14. Semi-structured deep piecewise exponential models. In Survival Prediction-Algorithms, Challenges and Applications, pages 40–53. PMLR, 2021.
  15. DeepPAMM: Deep piecewise exponential additive mixed models for complex hazard structures in survival analysis. In Pacific-Asia Conference on Knowledge Discovery and Data Mining, pages 249–261, 2022.
  16. A Long-Term Study of Prognosis in Monoclonal Gammopathy of Undetermined Significance. New England Journal of Medicine, 346(8):564–569, February 2002.
  17. DeepHit: A Deep Learning Approach to Survival Analysis With Competing Risks. In 23nd AAAI Conference on Artificial Intelligence, 2018.
  18. Survival regression with proper scoring rules and monotonic neural networks. In International Conference on Artificial Intelligence and Statistics, pages 1190–1205. PMLR, 2022.
  19. David Rügamer. A new PHO-rmula for improved performance of semi-structured networks. In Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pages 29291–29305. PMLR, 23–29 Jul 2023.
  20. Semi-structured distributional regression. The American Statistician, 0(0):1–12, 2023.
  21. Randomized 2 x 2 trial evaluating hormonal treatment and the duration of chemotherapy in node-positive breast cancer patients. german breast cancer study group. Journal of Clinical Oncology, 12(10):2086–2093, 1994.
  22. Raphael Sonabend. A theoretical and methodological framework for machine learning in survival analysis: Enabling transparent and accessible predictive modelling on right-censored time-to-event data. PhD thesis, UCL (University College London), 2021.
  23. Raphael Sonabend. Scoring rules in survival analysis. arXiv preprint arXiv:2212.05260, 2022.
  24. Avoiding C-hacking when evaluating survival distribution predictions with discrimination measures. Bioinformatics, 38(17):4178–4184, 07 2022. ISSN 1367-4803. doi: 10.1093/bioinformatics/btac451.
  25. Identification of biomarker-by-treatment interactions in randomized clinical trials with survival outcomes and high-dimensional spaces. Biometrical Journal, 59(4):685–701, 2017. ISSN 1521-4036. doi: 10.1002/bimj.201500234.
  26. How long does it take to build a nuclear power plant? A non-parametric event history approach with P-splines. Energy Policy, 70:163–171, July 2014. ISSN 0301-4215. doi: 10.1016/j.enpol.2014.03.015. http://www.sciencedirect.com/science/article/pii/S0301421514001621.
  27. Machine Learning for Survival Analysis: A Survey. ACM Computing Surveys (CSUR), 51(6):110:1–110:36, February 2019.
  28. Lee-Jen Wei. The accelerated failure time model: a useful alternative to the cox regression model in survival analysis. Statistics in medicine, 11(14-15):1871–1879, 1992.
  29. Deep learning for survival analysis: a review. Artificial Intelligence Review, 57(3):65, 2024.
  30. Simon N. Wood. Generalized Additive Models: An Introduction with R. Chapman & Hall/Crc Texts in Statistical Science, Boca Raton, 2 rev ed. edition, June 2017.
  31. Hiroki Yanagisawa. Proper scoring rules for survival analysis. arXiv preprint arXiv:2305.00621, 2023.
  32. Purchase behavior prediction in m-commerce with an optimized sampling methods. In 2015 IEEE International Conference on Data Mining Workshop (ICDMW), pages 1085–1092. IEEE, 2015.

Summary

We haven't generated a summary for this paper yet.

Lightbulb On Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets

This paper has been mentioned in 3 posts and received 4 likes.

Don't miss out on important new AI/ML research

See which papers are being discussed right now on X, Reddit, and more:

“Emergent Mind helps me see which AI papers have caught fire online.”

Philip

Philip

Creator, AI Explained on YouTube