Efficient Normalized Conformal Prediction and Uncertainty Quantification for Anti-Cancer Drug Sensitivity Prediction with Deep Regression Forests (2402.14080v1)
Abstract: Deep learning models are being adopted and applied on various critical decision-making tasks, yet they are trained to provide point predictions without providing degrees of confidence. The trustworthiness of deep learning models can be increased if paired with uncertainty estimations. Conformal Prediction has emerged as a promising method to pair machine learning models with prediction intervals, allowing for a view of the model's uncertainty. However, popular uncertainty estimation methods for conformal prediction fail to provide heteroskedastic intervals that are equally accurate for all samples. In this paper, we propose a method to estimate the uncertainty of each sample by calculating the variance obtained from a Deep Regression Forest. We show that the deep regression forest variance improves the efficiency and coverage of normalized inductive conformal prediction on a drug response prediction task.
- Artificial intelligence in us health care delivery. New England Journal of Medicine, 389(4):348–358, 2023.
- Randomized clinical trials of machine learning interventions in health care: a systematic review. JAMA Network Open, 5(9):e2233946–e2233946, 2022.
- Machine learning–based prediction models for different clinical risks in different hospitals: evaluation of live performance. Journal of Medical Internet Research, 24(6):e34295, 2022.
- Steps to avoid overuse and misuse of machine learning in clinical research. Nature Medicine, 28(10):1996–1999, 2022.
- Algorithmic learning in a random world, volume 29. Springer, 2005.
- Inductive confidence machines for regression. In Machine Learning: ECML 2002: 13th European Conference on Machine Learning Helsinki, Finland, August 19–23, 2002 Proceedings 13, pages 345–356. Springer, 2002.
- Conformal prediction of small-molecule drug resistance in cancer cell lines. In Conformal and Probabilistic Prediction with Applications, pages 92–108. PMLR, 2022.
- Reliable prediction errors for deep neural networks using test-time dropout. Journal of chemical information and modeling, 59(7):3330–3339, 2019.
- Conformal prediction under covariate shift. Advances in neural information processing systems, 32, 2019.
- Conformal prediction beyond exchangeability. The Annals of Statistics, 51(2):816–845, 2023.
- Concepts and applications of conformal prediction in computational drug discovery. arXiv preprint arXiv:1908.03569, 2019.
- Fair conformal predictors for applications in medical imaging. In Proceedings of the AAAI Conference on Artificial Intelligence, volume 36, pages 12008–12016, 2022.
- Deep regression forests for age estimation. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition, pages 2304–2313, 2018.
- Conformalized quantile regression. Advances in neural information processing systems, 32, 2019.
- Mondrian conformal regressors. In Conformal and Probabilistic Prediction and Applications, pages 114–133. PMLR, 2020.
- A gentle introduction to conformal prediction and distribution-free uncertainty quantification. arXiv preprint arXiv:2107.07511, 2021.
- A survey of neural trees. arXiv preprint arXiv:2209.03415, 2022.
- Learning a deep regression forest for head pose estimation from a single depth image. Journal of Circuits, Systems and Computers, 30(08):2150139, 2021.
- A hybrid model of convolutional neural networks and deep regression forests for crowd counting. Applied Intelligence, 50:2818–2832, 2020.
- Deep neural decision forests. In Proceedings of the IEEE international conference on computer vision, pages 1467–1475, 2015.
- Convolutional ordinal regression forest for image ordinal estimation. IEEE Transactions on Neural Networks and Learning Systems, 33(8):4084–4095, 2021.
- Meta ordinal regression forest for medical image classification with ordinal labels. IEEE/CAA Journal of Automatica Sinica, 9(7):1233–1247, 2022.
- Meta ordinal regression forest for learning with unsure lung nodules. In 2020 IEEE International Conference on Bioinformatics and Biomedicine (BIBM), pages 442–445. IEEE, 2020.
- Self-paced deep regression forests with consideration on underrepresented examples. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part XXX 16, pages 271–287. Springer, 2020.
- Vladimir Vovk. Conditional validity of inductive conformal predictors. In Asian conference on machine learning, pages 475–490. PMLR, 2012.
- Distribution-free prediction bands for non-parametric regression. Journal of the Royal Statistical Society Series B: Statistical Methodology, 76(1):71–96, 2014.
- Conformal prediction: a unified review of theory and new challenges. Bernoulli, 29(1):1–23, 2023.
- Distribution-free predictive inference for regression. Journal of the American Statistical Association, 113(523):1094–1111, 2018.
- Federated learning framework integrating refined cnn and deep regression forests. Bioinformatics Advances, 3(1):vbad036, 2023.
- The cancer cell line encyclopedia enables predictive modelling of anticancer drug sensitivity. Nature, 483(7391):603–607, 2012.
- Cancer Cell Line Encyclopedia Consortium and Genomics of Drug Sensitivity in Cancer Consortium. Pharmacogenomic agreement between two cancer cell line data sets. Nature, 528(7580):84, 2015.
- Next-generation characterization of the cancer cell line encyclopedia. Nature, 569(7757):503–508, 2019.
- Evaluating the consistency of large-scale pharmacogenomic studies. Briefings in Bioinformatics, 20(5):1734–1753, 2019.
- Chun Wei Yap. Padel-descriptor: An open source software to calculate molecular descriptors and fingerprints. Journal of computational chemistry, 32(7):1466–1474, 2011.
- Daniel Nolte (1 paper)
- Souparno Ghosh (11 papers)
- Ranadip Pal (6 papers)