Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
129 tokens/sec
GPT-4o
28 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficient Conformal Prediction under Data Heterogeneity (2312.15799v2)

Published 25 Dec 2023 in stat.ML and cs.LG

Abstract: Conformal Prediction (CP) stands out as a robust framework for uncertainty quantification, which is crucial for ensuring the reliability of predictions. However, common CP methods heavily rely on data exchangeability, a condition often violated in practice. Existing approaches for tackling non-exchangeability lead to methods that are not computable beyond the simplest examples. This work introduces a new efficient approach to CP that produces provably valid confidence sets for fairly general non-exchangeable data distributions. We illustrate the general theory with applications to the challenging setting of federated learning under data heterogeneity between agents. Our method allows constructing provably valid personalized prediction sets for agents in a fully federated way. The effectiveness of the proposed method is demonstrated in a series of experiments on real-world datasets.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (31)
  1. Conformalized unconditional quantile regression. In International Conference on Artificial Intelligence and Statistics, pages 10690–10702. PMLR.
  2. Conformal prediction: A gentle introduction. Foundations and Trends® in Machine Learning, 16(4):494–591.
  3. On a variance dependent dvoretzky-kiefer-wolfowitz inequality. arXiv preprint arXiv:2308.04757.
  4. Training-conditional coverage for distribution-free predictive inference. Electronic Journal of Statistics, 17(2):2044–2066.
  5. Concentration inequalities. In Summer school on machine learning, pages 208–240. Springer.
  6. Imagenet: A large-scale hierarchical image database. In 2009 IEEE conference on computer vision and pattern recognition, pages 248–255. Ieee.
  7. Asymptotic minimax character of the sample distribution function and of the classical multinomial estimator. The Annals of Mathematical Statistics, pages 642–669.
  8. Dwork, C. (2006). Differential privacy. In International colloquium on automata, languages, and programming, pages 1–12. Springer.
  9. The limits of distribution-free conditional predictive inference. Information and Inference: A Journal of the IMA, 10(2):455–482.
  10. Differential privacy in deep learning: an overview. In 2019 International Conference on Advanced Computing and Applications (ACOMP), pages 97–102. IEEE.
  11. The many faces of robustness: A critical analysis of out-of-distribution generalization. In Proceedings of the IEEE/CVF International Conference on Computer Vision (ICCV), pages 8340–8349.
  12. Benchmarking neural network robustness to common corruptions and perturbations. In International Conference on Learning Representations.
  13. One-shot federated conformal prediction. In International Conference on Machine Learning, pages 14153–14177. PMLR.
  14. A review of nonconformity measures for conformal prediction in regression. Conformal and Probabilistic Prediction with Applications, pages 369–383.
  15. Distribution-free prediction bands for non-parametric regression. Journal of the Royal Statistical Society Series B: Statistical Methodology, 76(1):71–96.
  16. Conformal inference of counterfactuals and individual treatment effects. Journal of the Royal Statistical Society: Series B (Statistical Methodology).
  17. Distribution-free federated learning with conformal predictions. arXiv preprint arXiv:2110.07661.
  18. Federated conformal predictors for distributed uncertainty quantification. In International Conference on Machine Learning, pages 22942–22964. PMLR.
  19. Massart, P. (1990). The tight constant in the dvoretzky-kiefer-wolfowitz inequality. The Annals of Probability, pages 1269–1283.
  20. Communication-efficient learning of deep networks from decentralized data. In Artificial intelligence and statistics, pages 1273–1282. PMLR.
  21. Moreau, J. J. (1963). Propriétés des applications «prox». Comptes rendus hebdomadaires des séances de l’Académie des sciences, 256:1069–1071.
  22. Papadopoulos, H. (2008). Inductive conformal prediction: Theory and application to neural networks. In Tools in artificial intelligence. Citeseer.
  23. Regression conformal prediction with nearest neighbours. Journal of Artificial Intelligence Research, 40:815–840.
  24. Conformal prediction for federated uncertainty quantification under label shift. In Proceedings of the 40th International Conference on Machine Learning, volume 202, pages 27907–27947.
  25. Distribution-free uncertainty quantification for classification under label shift. In Uncertainty in Artificial Intelligence, pages 844–853. PMLR.
  26. Classification with valid and adaptive coverage. Advances in Neural Information Processing Systems, 33:3581–3591.
  27. A tutorial on conformal prediction. Journal of Machine Learning Research, 9(3).
  28. Conformal prediction under covariate shift. Advances in neural information processing systems, 32.
  29. Vovk, V. (2012). Conditional validity of inductive conformal predictors. In Asian conference on machine learning, pages 475–490. PMLR.
  30. Algorithmic learning in a random world, volume 29. Springer.
  31. Federated inference with reliable uncertainty quantification over wireless channels via conformal prediction. arXiv preprint arXiv:2308.04237.

Summary

We haven't generated a summary for this paper yet.