Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
167 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Post-selection Inference for Conformal Prediction: Trading off Coverage for Precision (2304.06158v3)

Published 12 Apr 2023 in stat.ME and stat.ML

Abstract: Conformal inference has played a pivotal role in providing uncertainty quantification for black-box ML prediction algorithms with finite sample guarantees. Traditionally, conformal prediction inference requires a data-independent specification of miscoverage level. In practical applications, one might want to update the miscoverage level after computing the prediction set. For example, in the context of binary classification, the analyst might start with a 95$\%$ prediction sets and see that most prediction sets contain all outcome classes. Prediction sets with both classes being undesirable, the analyst might desire to consider, say 80$\%$ prediction set. Construction of prediction sets that guarantee coverage with data-dependent miscoverage level can be considered as a post-selection inference problem. In this work, we develop simultaneous conformal inference to account for data-dependent miscoverage levels. Under the assumption of independent and identically distributed observations, our proposed methods have a finite sample simultaneous guarantee over all miscoverage levels. This allows practitioners to trade freely coverage probability for the quality of the prediction set by any criterion of their choice (say size of prediction set) while maintaining the finite sample guarantees similar to traditional conformal inference.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (53)
  1. Asymptotic theory of certain” goodness of fit” criteria based on stochastic processes. The annals of mathematical statistics, pages 193–212.
  2. The limits of distribution-free conditional predictive inference. Information and Inference: A Journal of the IMA, 10(2):455–482.
  3. Conformal prediction beyond exchangeability. arXiv preprint arXiv:2202.13415.
  4. Testing for outliers with conformal p-values. The Annals of Statistics, 51(1):149–178.
  5. Valid post-selection inference. The Annals of Statistics, pages 802–837.
  6. Goodness-of-fit test statistics that dominate the kolmogorov statistics. Zeitschrift für Wahrscheinlichkeitstheorie und verwandte Gebiete, 47(1):47–59.
  7. Improved online conformal prediction via strongly adaptive online learning. arXiv preprint arXiv:2302.07869.
  8. Training-conditional coverage for distribution-free predictive inference. arXiv preprint arXiv:2205.03647.
  9. Breiman, L. (2001). Random forests. Machine learning, 45:5–32.
  10. Airfoil self-noise and prediction. Technical report.
  11. Cam, L. L. (1960). An approximation theorem for the Poisson binomial distribution. Pacific Journal of Mathematics, 10(4):1181 – 1197.
  12. Conformalized survival analysis. Journal of the Royal Statistical Society Series B: Statistical Methodology, 85(1):24–45.
  13. Knowing what you know: valid and validated confidence sets in multiclass and multilabel prediction. arXiv preprint arXiv:2004.10181.
  14. Distributional conformal prediction. Proceedings of the National Academy of Sciences, 118(48):e2107794118.
  15. Selecting the number of principal components: Estimation of the true rank of a noisy matrix. The Annals of Statistics, pages 2590–2617.
  16. A new approach to tests and confidence bands for distribution functions. The Annals of Statistics, 51(1):260–289.
  17. Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator. The Annals of Mathematical Statistics, pages 642–669.
  18. Eicker, F. (1979). The asymptotic distribution of the suprema of the standardized empirical processes. The Annals of Statistics, 7(1):116–138.
  19. Adaptive conformal inference under distribution shift. Advances in Neural Information Processing Systems, 34:1660–1672.
  20. Conformal inference for online prediction with arbitrary distribution shifts. arXiv preprint arXiv:2208.08401.
  21. Conformalized survival analysis with adaptive cutoffs. arXiv preprint arXiv:2211.01227.
  22. Guttman, I. (1967). Statistical tolerance regions. Classical and Bayesian.
  23. Some new inequalities for beta distributions. Statistics & Probability Letters, page 109783.
  24. Exact post-selection inference for changepoint detection and other generalized lasso problems. arXiv preprint arXiv:1606.03552.
  25. Goodness-of-fit tests via phi-divergences. The Annals of Statistics, 35(5).
  26. Selection by prediction with conformal p-values. arXiv preprint arXiv:2210.01408.
  27. Uniform convergence rate of the kernel density estimator adaptive to intrinsic volume dimension. In International Conference on Machine Learning, pages 3398–3407. PMLR.
  28. Adaptive, distribution-free prediction intervals for deep networks. In International Conference on Artificial Intelligence and Statistics, pages 4346–4356. PMLR.
  29. Statistical tolerance regions: theory, applications, and computation. John Wiley & Sons.
  30. Nested conformal prediction sets for classification with applications to probation data. The Annals of Applied Statistics, 17(1):761–785.
  31. Distribution-free predictive inference for regression. Journal of the American Statistical Association, 113(523):1094–1111.
  32. Distribution-free prediction sets. Journal of the American Statistical Association, 108(501):278–287.
  33. Conformal inference of counterfactuals and individual treatment effects. Journal of the Royal Statistical Society Series B: Statistical Methodology, 83(5):911–938.
  34. The essential histogram. Biometrika, 107(2):347–364.
  35. Conformal prediction for network-assisted regression. arXiv preprint arXiv:2302.10095.
  36. Massart, P. (1990). The tight constant in the dvoretzky-kiefer-wolfowitz inequality. The annals of Probability, pages 1269–1283.
  37. Quantile regression forests. Journal of machine learning research, 7(6).
  38. Olshen, R. A. (1973). The conditional level of the f—test. Journal of the American Statistical Association, 68(343):692–698.
  39. Owen, A. B. (1995). Nonparametric likelihood confidence bands for a distribution function. Journal of the American Statistical Association, 90(430):516–521.
  40. Prokhorov, Y. V. (1953). Asymptotic behavior of the binomial distribution. Uspekhi Matematicheskikh Nauk, 8(3):135–142.
  41. Optimal detection of a jump in the intensity of a poisson process or in a density with likelihood ratio statistics. Scandinavian Journal of Statistics, 40(4):752–769.
  42. Conformalized quantile regression. Advances in neural information processing systems, 32.
  43. Classification with valid and adaptive coverage. Advances in Neural Information Processing Systems, 33:3581–3591.
  44. Sarkar, S. K. (2008). Generalizing simes’ test and hochberg’s stepup procedure. The Annals of Statistics, 36(1):337–363.
  45. A comparison of some conformal quantile regression methods. Stat, 9(1):e261.
  46. Conformal prediction under covariate shift. Advances in neural information processing systems, 32.
  47. Valiant, L. G. (1984). A theory of the learnable. Communications of the ACM, 27(11):1134–1142.
  48. Vovk, V. (2012). Conditional validity of inductive conformal predictors. In Asian conference on machine learning, pages 475–490. PMLR.
  49. Machine-learning applications of algorithmic randomness. In Proceedings of the Sixteenth International Conference on Machine Learning, ICML ’99, page 444–453, San Francisco, CA, USA. Morgan Kaufmann Publishers Inc.
  50. Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms. arXiv preprint arXiv:1708.07747.
  51. Conformal prediction interval for dynamic time-series. In International Conference on Machine Learning, pages 11559–11569. PMLR.
  52. Doubly robust calibration of prediction sets under covariate shift. arXiv preprint arXiv:2203.01761.
  53. Adaptive conformal predictions for time series. In International Conference on Machine Learning, pages 25834–25866. PMLR.
Citations (3)

Summary

We haven't generated a summary for this paper yet.