From Conformal Predictions to Confidence Regions (2405.18601v1)
Abstract: Conformal prediction methodologies have significantly advanced the quantification of uncertainties in predictive models. Yet, the construction of confidence regions for model parameters presents a notable challenge, often necessitating stringent assumptions regarding data distribution or merely providing asymptotic guarantees. We introduce a novel approach termed CCR, which employs a combination of conformal prediction intervals for the model outputs to establish confidence regions for model parameters. We present coverage guarantees under minimal assumptions on noise and that is valid in finite sample regime. Our approach is applicable to both split conformal predictions and black-box methodologies including full or cross-conformal approaches. In the specific case of linear models, the derived confidence region manifests as the feasible set of a Mixed-Integer Linear Program (MILP), facilitating the deduction of confidence intervals for individual parameters and enabling robust optimization. We empirically compare CCR to recent advancements in challenging settings such as with heteroskedastic and non-Gaussian noise.
- Izabel Cristina Alcantara and Francisco José A Cysneiros. Slash-elliptical nonlinear regression model. Brazilian Journal of Probability and Statistics, pages 87–110, 2017.
- Prediction-powered inference. arXiv preprint arXiv:2301.09633, 2023.
- Davor Balzar. X-ray diffraction line broadening: modeling and applications to high-tc superconductors. Journal of research of the National Institute of Standards and Technology, 98(3):321, 1993.
- Bayesian theory, volume 405. John Wiley & Sons, 2009.
- M.C. Campi and E. Weyer. Guaranteed non-asymptotic confidence regions in system identification. Automatica, 41:1751–1764, 2005.
- Ismaël Castillo. Bayesian nonparametric statistics, st-flour lecture notes. arXiv preprint arXiv:2402.16422, 2024.
- Optimum linear regression in additive cauchy–gaussian noise. Signal processing, 106:312–318, 2015.
- Distributional conformal prediction. Proceedings of the National Academy of Sciences, 118(48):e2107794118, 2021.
- Giovanni Cherubin. Majority vote ensembles of conformal predictors. Machine Learning, 108(3):475–488, 2019.
- CK Chow. Recognition error and reject trade-off. Technical report, Nevada Univ., Las Vegas, NV (United States), 1994.
- Sign-perturbed sums: A new system identification approach for constructing exact non-asymptotic confidence regions in linear regression models. IEEE Transactions on Signal Processing, 63, jan 2015.
- Parameter identification for nonlinear systems: Guaranteed confidence regions through lscr. Automatica, 43, 2007.
- H. E. Daniels. A Distribution-Free Test for Regression Parameters. The Annals of Mathematical Statistics, 25, 1954.
- Finite sample confidence regions for parameters in prediction error identification using output error models. IFAC Proceedings Volumes, 41, 2008.
- Stable distributions as noise models for molecular communication. In 2015 IEEE Global Communications Conference (GLOBECOM), pages 1–6. IEEE, 2015.
- Conformal prediction is robust to dispersive label noise. In Conformal and Probabilistic Prediction with Applications, pages 624–626. PMLR, 2023.
- Conformal Bayesian computation. In NeurIPS, 2021.
- Merging uncertainty sets via majority vote, 2024.
- Selectivenet: A deep neural network with an integrated reject option. In International conference on machine learning, pages 2151–2159. PMLR, 2019.
- Conformal inference for online prediction with arbitrary distribution shifts. arXiv preprint arXiv:2208.08401, 2022.
- Conformal prediction via regression-as-classification. arXiv preprint arXiv:2404.08168, 2024.
- Finite sample confidence regions for linear regression parameters using arbitrary predictors. arXiv preprint arXiv:2401.15254, 2024.
- Introduction to Operations Research. McGraw-Hill, 2001.
- Roel Hulsman. Distribution-free finite-sample guarantees and split conformal prediction, 2022.
- Koen Jochmans. Heteroscedasticity-robust inference in linear regression models with many covariates. Journal of the American Statistical Association, 117, 2022.
- Distribution-free prediction sets. Journal of the American Statistical Association, 108(501):278–287, 2013.
- Distribution-free predictive inference for regression. Journal of the American Statistical Association, 113(523):1094–1111, 2018.
- Conformal prediction intervals with temporal dependence. arXiv preprint arXiv:2205.12940, 2022.
- Classification with reject option using conformal prediction. In Advances in Knowledge Discovery and Data Mining: 22nd Pacific-Asia Conference, PAKDD 2018, Melbourne, VIC, Australia, June 3-6, 2018, Proceedings, Part I 22, pages 94–105. Springer, 2018.
- Evolution equations for the probabilistic generalization of the voigt profile function. Journal of computational and applied mathematics, 233(6):1590–1595, 2010.
- Inductive confidence machines for regression. In Machine Learning: ECML 2002: 13th European Conference on Machine Learning Helsinki, Finland, August 19–23, 2002 Proceedings 13. Springer, 2002.
- Randomized and exchangeable improvements of markov’s, chebyshev’s and chernoff’s inequalities, 2023.
- Understanding some long-tailed symmetrical distributions. Statistica Neerlandica, 26(3):211–226, 1972.
- Conformalized quantile regression. In NeurIPS, 2019a.
- Conformalized quantile regression. In Advances in Neural Information Processing Systems, 2019b.
- G. Shafer and V. Vovk. A tutorial on conformal prediction. Journal of Machine Learning Research, 2008.
- Mapie: an open-source library for distribution-free uncertainty quantification. arXiv preprint arXiv:2207.12274, 2022.
- Alexandre B Tsybakov. Nonparametric estimators. Introduction to Nonparametric Estimation, pages 1–76, 2009.
- Aad W Van der Vaart. Asymptotic statistics, volume 3. Cambridge university press, 2000.
- Vladimir Vovk. Conditional validity of inductive conformal predictors. In Asian conference on machine learning, pages 475–490. PMLR, 2012.
- Abraham Wald. Contributions to the theory of statistical estimation and testing hypotheses. The Annals of Mathematical Statistics, 10, 1939.
- Abraham Wald. Statistical decision functions which minimize the maximum risk. Annals of Mathematics, 46, 1945.
- Larry Wasserman. All of nonparametric statistics. Springer Science & Business Media, 2006.
- Universal inference. Proceedings of the National Academy of Sciences, 117:16880–16890, 2020a.
- Universal inference. Proceedings of the National Academy of Sciences, 117(29):16880–16890, 2020b.
- Integer and Combinatorial Optimization. Wiley, 2014.
- Adaptive conformal predictions for time series. In International Conference on Machine Learning, pages 25834–25866. PMLR, 2022.
- Regression with reject option and application to knn. Advances in Neural Information Processing Systems, 33, 2020.