Valid prediction intervals for regression problems (2107.00363v4)
Abstract: Over the last few decades, various methods have been proposed for estimating prediction intervals in regression settings, including Bayesian methods, ensemble methods, direct interval estimation methods and conformal prediction methods. An important issue is the calibration of these methods: the generated prediction intervals should have a predefined coverage level, without being overly conservative. In this work, we review the above four classes of methods from a conceptual and experimental point of view. Results on benchmark data sets from various domains highlight large fluctuations in performance from one data set to another. These observations can be attributed to the violation of certain assumptions that are inherent to some classes of methods. We illustrate how conformal prediction can be used as a general calibration procedure for methods that deliver poor results without a calibration step.
- DOI 10.7910/DVN/SIWH9F
- The Annals of Statistics 49(1), 486–507 (2021)
- Journal of the American statistical Association 112(518), 859–877 (2017)
- In: International Conference on Machine Learning, pp. 1613–1622. PMLR (2015)
- Annals of Mathematics and Artificial Intelligence 81(1), 125–144 (2017)
- Breiman, L.: Bagging predictors. Machine Learning 24(2), 123–140 (1996)
- Breiman, L.: Random forests. Machine Learning 45(1), 5–32 (2001)
- Buza, K.: Feedback prediction for blogs. In: Data Analysis, Machine Learning and Knowledge Discovery, pp. 145–152. Springer International Publishing (2014)
- IEEE Access 9, 23357–23384 (2021)
- In: 2017 IEEE Intelligent Vehicles Symposium (IV), pp. 1856–1860. IEEE (2017)
- In: Conference On Learning Theory, pp. 732–749. PMLR (2018)
- Proceedings of the Institution of Mechanical Engineers, Part M: Journal of Engineering for the Maritime Environment 230(1), 136–153 (2016)
- Corke, P.I.: A robotics toolbox for MATLAB. IEEE Robotics & Automation Magazine 3(1), 24–32 (1996)
- http://archive.ics.uci.edu/ml (2017). Data set repository
- Efron, B.: Jackknife-after-bootstrap standard errors and influence functions. Journal of the Royal Statistical Society: Series B (Methodological) 54(1), 83–111 (1992)
- PLOS ONE 13(10) (2018)
- Faulkenberry, D.G.: A method of obtaining prediction intervals. Journal of the American Statistical Association 68(342), 433–435 (1973)
- In 10th Brazilian Congress on Computational Intelligence (CBIC) pp. 1–7 (2011)
- Fink, D.: A compendium of conjugate priors. Tech. rep., Montana State Univeristy (1997)
- URL \url{https://arxiv.org/avs/1912.02757}
- Annals of Mathematical Statistics 27(1), 162–179 (1956)
- Gal, Y.: Uncertainty in deep learning. University of Cambridge 1(3) (2016)
- In: Proceedings of The 33rd International Conference on Machine Learning, Proceedings of Machine Learning Research, vol. 48, pp. 1050–1059. PMLR (2016)
- In: Proceedings of the 14th Conference on Uncertainty in Artificial Intelligence, UAI’98, pp. 148–155. Morgan Kaufmann Publishers Inc. (1998)
- In: Advances in Neural Information Processing Systems (2018)
- IEEE Transactions on Pattern Analysis and Machine Intelligence pp. 1–1 (2020)
- Gentle, J.E.: Monte Carlo methods for statistical inference. In: Computational Statistics, pp. 417–433. Springer (2009)
- Journal of the American statistical Association 102(477), 359–378 (2007)
- In: Case Studies in Applied Bayesian Data Science, pp. 45–87. Springer (2020)
- In: 3rd International Conference on Learning Representations, ICLR 2015 (2015)
- In: International Conference on Machine Learning, pp. 1321–1330. PMLR (2017)
- Havlicek, O.: GitHub Issue: Very slow QRF prediction. https://github.com/scikit-garden/scikit-garden/issues/74
- In: Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, pp. 41–50 (2019)
- In: Artificial Intelligence and Statistics, pp. 351–360. PMLR (2015)
- Heskes, T.: Practical confidence and prediction intervals. In: Proceedings of the 9th International Conference on Neural Information Processing Systems, NIPS’96, pp. 176–182. MIT Press (1996)
- In: Proceedings of the Sixth Annual Conference on Computational Learning Theory, COLT ’93, pp. 5–13. Association for Computing Machinery (1993)
- Journal of the American Medical Informatics Association 19(2), 263–274 (2012)
- Machine Learning 97(1-2), 155–176 (2014)
- In: Proceedings of the International Conference on Emerging Trends in Computer and Electronics Engineering (ICETCEE), pp. 13–18 (2012)
- In: Advances in Neural Information Processing Systems, vol. 30, pp. 5580–5590. Curran Associates (2017)
- IEEE Access 6, 54033–54041 (2018)
- IEEE Transactions on Neural Networks 22(9), 1341–1356 (2011). DOI 10.1109/TNN.2011.2162110
- IEEE Transactions on Neural Networks 22(3), 337–346 (2011)
- IEEE Transactions on Neural Networks and Learning Systems 26(8), 1810–1815 (2015)
- In: International Conference on Artificial Intelligence and Statistics, pp. 4346–4356. PMLR (2020)
- Journal of Economic Perspectives 15(4), 143–156 (2001)
- In: Proceedings of the 35th International Conference on Machine Learning, Proceedings of Machine Learning Research, vol. 80, pp. 2796–2804. PMLR (2018)
- In: Proceedings of the 20th International Conference on Artificial Intelligence and Statistics, Proceedings of Machine Learning Research, vol. 54, pp. 623–631. PMLR (2017)
- In: Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, pp. 6405–6416. Curran Associates (2017)
- CoRR abs/1511.06314 (2015). URL http://arxiv.org/abs/1511.06314
- Journal of the Royal Statistical Society: Series B: Statistical Methodology pp. 71–96 (2014)
- In: Advances in Neural Information Processing Systems, vol. 32. Curran Associates (2019)
- MacKay, D.J.: A practical bayesian framework for backpropagation networks. Neural Computation 4(3), 448–472 (1992)
- In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18, pp. 7047–7058. Curran Associates (2018)
- Meinshausen, N.: Quantile regression forests. Journal of Machine Learning Research 7(Jun), 983–999 (2006)
- In: 2020 IEEE International Conference on Robotics and Automation (ICRA), pp. 7344–7350 (2020)
- Miller, R.G.: The jackknife-a review. Biometrika 61(1), 1–15 (1974)
- In: Proceedings of the AAAI Conference on Artificial Intelligence, AAAI’15, vol. 29, pp. 2901–2907. AAAI Press (2015)
- Neal, R.M.: Bayesian Learning for Neural Networks. Springer-Verlag (1996)
- In: Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN’94), vol. 1, pp. 55–60. IEEE (1994)
- In: International Conference on Machine Learning (ICML), pp. 3956–3965. PMLR (2018)
- In: Proceedings of the IASTED International Conference on Artificial Intelligence and Applications (AIA 2008), pp. 64–69 (2008)
- In: European Conference on Machine Learning, pp. 345–356. Springer (2002)
- In: 19th IEEE International Conference on Tools with Artificial Intelligence (ICTAI 2007), vol. 2, pp. 388–395 (2007)
- Journal of Artificial Intelligence Research 40, 815–840 (2011)
- In: Advances in Neural Information Processing Systems 32, pp. 8024–8035. Curran Associates (2019)
- Ph.D. Thesis, University of Cambridge (2020)
- In: International Conference on Machine Learning, pp. 4075–4084. PMLR (2018)
- Journal of Machine Learning Research 12, 2825–2830 (2011)
- Journal of Construction Engineering and Management 142(2) (2016)
- European Journal of Operational Research 141(3), 660–678 (2002)
- Reynolds, D.A.: Gaussian mixture models. Encyclopedia of Biometrics 741, 659–663 (2009)
- In: Advances in Neural Information Processing Systems, pp. 3543–3553 (2019)
- In: In Proceedings of the International Joint Conference on Artificial Intelligence, pp. 722–726 (1999)
- Cambridge University Press (2016)
- Stat 9(1) (2020)
- Journal of Statistical Planning and Inference 195, 126–140 (2018)
- In: IEEE UKSim-AMSS 17th International Conference on Computer Modelling and Simulation (UKSim2015) (2015)
- In: 2017 International Conference on Computer, Communications and Electronics (Comptelix), pp. 162–167. IEEE (2017)
- In: Proceedings of the 8th International Conference on Neural Information Processing Systems, NIPS’95, pp. 190–196. MIT Press (1995)
- In: Proceedings of the 36th International Conference on Machine Learning, Proceedings of Machine Learning Research, vol. 97, pp. 5897–5906. PMLR (2019)
- Journal of Machine Learning Research 15(56), 1929–1958 (2014)
- In: International Conference on Machine Learning, pp. 4907–4916. PMLR (2018)
- arXiv preprint arXiv:2002.12860 (2020)
- In: International Conference on Machine Learning, pp. 9690–9700. PMLR (2020)
- Vovk, V.: Conditional validity of inductive conformal predictors. In: Proceedings of the Asian Conference on Machine Learning, Proceedings of Machine Learning Research, vol. 25, pp. 475–490. PMLR (2012)
- Vovk, V.: Cross-conformal predictors. Annals of Mathematics and Artificial Intelligence 74(1), 9–28 (2015)
- In: Proceedings of the Sixteenth International Conference on Machine Learning, ICML ’99, pp. 444–453. Morgan Kaufmann Publishers Inc. (1999)
- Springer Science & Business Media (2005)
- In: Conformal and Probabilistic Prediction and Applications, pp. 82–102. PMLR (2017)
- The Journal of Machine Learning Research 15(1), 1625–1651 (2014)
- In: Advances in Neural Information Processing Systems, pp. 514–520 (1996)
- In: International Conference on Machine Learning, pp. 1775–1784. PMLR (2015)
- Yeh, I.C.: Modeling of strength of high-performance concrete using artificial neural networks. Cement and Concrete Research 28(12), 1797–1808 (1998)
- The American Statistician 74(4), 392–406 (2020)