Proper Scoring Rules for Survival Analysis (2305.00621v3)
Abstract: Survival analysis is the problem of estimating probability distributions for future event times, which can be seen as a problem in uncertainty quantification. Although there are fundamental theories on strictly proper scoring rules for uncertainty quantification, little is known about those for survival analysis. In this paper, we investigate extensions of four major strictly proper scoring rules for survival analysis and we prove that these extensions are proper under certain conditions, which arise from the discretization of the estimation of probability distributions. We also compare the estimation performances of these extended scoring rules by using real datasets, and the extensions of the logarithmic score and the Brier score performed the best.
- A time-dependent discrimination index for survival data. Statistics in Medicine, 24(24):3927–3944, 2005.
- Countdown regression: Sharp and calibrated survival predictions. In Proceedings of UAI 2019, pp. 145–155, 2019.
- Benedetti, R. Scoring rules for forecast verification. American Meteorological Society, 138(1):203–211, 2010.
- Pitfalls of epistemic uncertainty quantification through loss minimisation. In Proceedings of NeurIPS 2022, 2022.
- The c-index is not proper for the evaluation of t𝑡titalic_t-year predicted risks. Biostatistics, 20(2):347–357, 2018.
- Brier, G. W. Verification of forecasts expressed in terms of probability. Monthly Weather Review, 78(1):1–3, 1950.
- Calibration and uncertainty in neural time-to-event modeling. IEEE Transactions on Neural Networks and Learning Systems, 34(4):1666–1680, 2020.
- Cox, D. R. Regression models and life-tables. Journal of the Royal Statistical Society, Series B, 34(2):187–220, 1972.
- Time to default in credit scoring using survival analysis: a benchmark study. Journal of the Operational Research Society, 68(6):652–665, 2017.
- Use of nonclonal serum immunoglobulin free light chains to predict overall survival in the general population. Mayo Clinic Proceedings, 87(6):517–523, 2012.
- Strictly proper scoring rules, prediction, and estimation. Journal of the American Statistical Association, 102(477):359–378, 2007.
- X-CAL: Explicit calibration for survival analysis. In Proceedings of NeurIPS 2020, pp. 18296–18307, 2020.
- Good, I. J. Rational decisions. Journal of the Royal Statistical Society. Series B (Methodological), 14(1):107–114, 1952.
- Assessment and comparison of prognostic classification schemes for survival data. Statistics in Medicine, 18(17–18):2529–2545, 1999.
- Effective ways to build and evaluate individual survival distributions. Journal of Machine Learning Research, 21(85):1–63, 2020.
- Inverse-weighted survival games. In Proceedings of NeurIPS 2021, pp. 2160–2172, 2021.
- Evaluating the yield of medical tests. Journal of the American Medical Association, 247(18):2543–2546, 1982.
- Estimating calibrated individualized survival curves with deep learning. In Proceedings of AAAI 2021, pp. 240–248, 2021.
- Nonparametric estimation from incomplete observations. Journal of the American Statistical Association, 53(282):457–481, 1958.
- Adam: A method for stochastic optimization. In Proceedings of ICLR 2015, 2015.
- The SUPPORT prognostic model. Objective estimates of survival for seriously ill hospitalized adults. Study to understand prognoses and preferences for outcomes and risks of treatments. Annals of Internal Medicine, 122(3):191–203, 1995.
- Regression quantiles. Econometrica, 46(1):33–50, 1978.
- Quantile regression. Journal of economic perspectives, 15(4):143–156, 2001.
- Time-to-event prediction with neural networks and Cox regression. Journal of Machine Learning Research, 20(129):1–30, 2019.
- DeepHit: A deep learning approach to survival analysis with competing risks. In Proceedings of AAAI-18, pp. 2314–2321, 2018.
- Outcomes of localized prostate cancer following conservative management. Journal of the American Medical Association, 302(11):1202–1209, 2009.
- Philosophical Lectures on Probability: collected, edited, and annotated by Alberto Mura. Synthese Library. Springer Netherlands, 2008.
- Correction to censored regression quantiles by S. Portnoy, 98 (2003), 1001–1012. Journal of the American Statistical Association, 101(474):860–861, 2006.
- Decision Theory: Principles and Approaches. Wiley Series in Probability and Statistics. Wiley, 2009.
- Censored quantile regression neural networks. In Proceedings of NeurIPS 2022, 2022.
- Peng, L. Quantile regression for survival data. Annual Review of Statistics and Its Application, 8:413–437, 2021.
- Portnoy, S. Censored regression quantiles. Journal of the American Statistical Association, 98(464):1001–1012, 2003.
- R Core Team. R: A Language and Environment for Statistical Computing. R Foundation for Statistical Computing, Vienna, Austria, 2016. URL https://www.R-project.org/.
- Deep recurrent survival analysis. In Proceedings of AAAI-19, pp. 4798–4805, 2019.
- Survival regression with proper scoring rules and monotonic neural networks. In Proceedings of AISTATS 2022, 2022.
- A penny for your thoughts: a survey of methods for eliciting beliefs. Experimental Economics, 18:457–490, 2015.
- Avoiding c-hacking when evaluating survival distribution predictions with discrimination measures. Bioinformatics, 38(17):4178–4184, 2022.
- A hierarchical approach to multi-event survival analysis. In Proceedings of AAAI 2021, pp. 591–599, 2021.
- On the C-statistics for evaluating overall adequacy of risk prediction procedures with censored survival data. Statistics in Medicine, 30(10):1105–1117, 2011.
- Machine learning for survival analysis: A survey. ACM Computing Surveys, 51(6):1–36, 2019.
- SAFE: A neural survival analysis model for fraud early detection. In Proceedings of AAAI-19, pp. 1278–1285, 2019.
- Deep extended hazard models for survival analysis. In Proceedings of NeurIPS 2021, 2021.