Conformal Prediction for Ensembles: Improving Efficiency via Score-Based Aggregation (2405.16246v3)
Abstract: Distribution-free uncertainty estimation for ensemble methods is increasingly desirable due to the widening deployment of multi-modal black-box predictive models. Conformal prediction is one approach that avoids such distributional assumptions. Methods for conformal aggregation have in turn been proposed for ensembled prediction, where the prediction regions of individual models are merged as to retain coverage guarantees while minimizing conservatism. Merging the prediction regions directly, however, sacrifices structures present in the conformal scores that can further reduce conservatism. We, therefore, propose a novel framework that extends the standard scalar formulation of a score function to a multivariate score that produces more efficient prediction regions. We then demonstrate that such a framework can be efficiently leveraged in both classification and predict-then-optimize regression settings downstream and empirically show the advantage over alternate conformal aggregation methods.
- Adding conditional control to text-to-image diffusion models. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 3836–3847, 2023.
- Learning transferable visual models from natural language supervision. In International conference on machine learning, pages 8748–8763. PMLR, 2021.
- Shiliang Sun. A survey of multi-view machine learning. Neural computing and applications, 23:2031–2038, 2013.
- Multi-view learning overview: Recent progress and new challenges. Information Fusion, 38:43–54, 2017.
- Deep multi-view learning methods: A review. Neurocomputing, 448:106–129, 2021.
- A multi-view deep learning framework for eeg seizure detection. IEEE journal of biomedical and health informatics, 23(1):83–94, 2018.
- A review on machine learning principles for multi-view biological data integration. Briefings in bioinformatics, 19(2):325–340, 2018.
- A multi-view deep learning method for epileptic seizure detection using short-time fourier transform. In Proceedings of the 8th ACM international conference on bioinformatics, computational biology, and health informatics, pages 213–222, 2017.
- Deep learning-based multi-view fusion model for screening 2019 novel coronavirus pneumonia: a multicentre study. European Journal of Radiology, 128:109041, 2020.
- Choosing the best sensor fusion method: A machine-learning approach. Sensors, 20(8):2350, 2020.
- Machine learning/artificial intelligence for sensor data fusion–opportunities and challenges. IEEE Aerospace and Electronic Systems Magazine, 36(7):80–93, 2021.
- A review on challenges of autonomous mobile robot and sensor fusion methods. IEEE Access, 8:39830–39846, 2020.
- Sensor and sensor fusion technology in autonomous vehicles: A review. Sensors, 21(6):2140, 2021.
- Amvnet: Assertion-based multi-view fusion network for lidar semantic segmentation. arXiv preprint arXiv:2012.04934, 2020.
- Trusted multi-view classification with dynamic evidential fusion. IEEE transactions on pattern analysis and machine intelligence, 45(2):2551–2566, 2022.
- Surface-electromyography-based gesture recognition by multi-view deep learning. IEEE Transactions on Biomedical Engineering, 66(10):2964–2973, 2019.
- Mogonet integrates multi-omics data using graph convolutional networks allowing patient classification and biomarker identification. Nature Communications, 12(1):3445, 2021.
- Uncertainty-aware audiovisual activity recognition using deep bayesian variational inference. In Proceedings of the IEEE/CVF international conference on computer vision, pages 6301–6310, 2019.
- Uno: Uncertainty-aware noisy-or multimodal fusion for unanticipated input degradation. In 2020 IEEE International Conference on Robotics and Automation (ICRA), pages 5716–5723. IEEE, 2020.
- Transforming neural-net output levels to probability distributions. Advances in neural information processing systems, 3, 1990.
- Training independent subnetworks for robust prediction. arXiv preprint arXiv:2010.06610, 2020.
- Predictive uncertainty estimation via prior networks. Advances in neural information processing systems, 31, 2018.
- A gentle introduction to conformal prediction and distribution-free uncertainty quantification. arXiv preprint arXiv:2107.07511, 2021.
- A tutorial on conformal prediction. Journal of Machine Learning Research, 9(3), 2008.
- Conformal prediction sets improve human decision making. arXiv preprint arXiv:2401.13744, 2024.
- Conformal contextual robust optimization. arXiv preprint arXiv:2310.10003, 2023.
- Shunichi Ohmori. A predictive prescription using minimum volume k-nearest neighbor enclosing ellipsoid and robust optimization. Mathematics, 9(2):119, 2021.
- Data-driven conditional robust optimization. Advances in Neural Information Processing Systems, 35:9525–9537, 2022.
- Predict-then-calibrate: A new perspective of robust contextual lp. arXiv preprint arXiv:2305.15686, 2023.
- A survey on multiview clustering. IEEE transactions on artificial intelligence, 2(2):146–168, 2021.
- On deep multi-view representation learning. In International conference on machine learning, pages 1083–1092. PMLR, 2015.
- Chong Wang. Variational bayesian approach to canonical correlation analysis. IEEE Transactions on Neural Networks, 18(3):905–910, 2007.
- Deep canonical correlation analysis. In International conference on machine learning, pages 1247–1255. PMLR, 2013.
- Deep variational canonical correlation analysis. arXiv preprint arXiv:1610.03454, 2016.
- Learning representations by maximizing mutual information across views. Advances in neural information processing systems, 32, 2019.
- Efficient large-scale multi-modal classification. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
- Late fusion incomplete multi-view clustering. IEEE transactions on pattern analysis and machine intelligence, 41(10):2410–2423, 2018.
- Multi-view clustering via late fusion alignment maximization. In IJCAI, pages 3778–3784, 2019.
- Late fusion multiple kernel clustering with proxy graph refinement. IEEE Transactions on Neural Networks and Learning Systems, 2021.
- Late fusion multiple kernel clustering with local kernel alignment maximization. IEEE Transactions on Multimedia, 2021.
- Majority vote of diverse classifiers for late fusion. In Structural, Syntactic, and Statistical Pattern Recognition: Joint IAPR International Workshop, S+ SSPR 2014, Joensuu, Finland, August 20-22, 2014. Proceedings, pages 153–162. Springer, 2014.
- Computing location depth and regression depth in higher dimensions. Statistics and Computing, 8:193–203, 1998.
- Robert Serfling. Quantile functions for multivariate analysis: approaches and applications. Statistica Neerlandica, 56(2):214–232, 2002.
- Quantile tomography: Using quantiles with multivariate data. Statistica Sinica, pages 1589–1610, 2012.
- On directional multiple-output quantile regression. Journal of Multivariate Analysis, 102(2):193–212, 2011.
- Multivariate quantiles and multiple-output regression quantiles: From ℓ1subscriptℓ1{\ell}_{1}roman_ℓ start_POSTSUBSCRIPT 1 end_POSTSUBSCRIPT optimization to halfspace depth. Annals of Statistics, 38:635–669, 2010.
- Calibrated multiple-output quantile regression with representation learning. Journal of Machine Learning Research, 24(24):1–48, 2023.
- A simple algorithm for uniform sampling on the surface of a hypersphere. arXiv preprint arXiv:2204.14004, 2022.
- Multi-modal conformal prediction regions by optimizing convex shape templates. arXiv preprint arXiv:2312.07434, 2023.
- Robert Duin. Multiple Features. UCI Machine Learning Repository. DOI: https://doi.org/10.24432/C5HC70.
- Averting a crisis in simulation-based inference. arXiv preprint arXiv:2110.06581, 2021.
- End-to-end conditional robust optimization. arXiv preprint arXiv:2403.04670, 2024.
- Benchmarking simulation-based inference. In International Conference on Artificial Intelligence and Statistics, pages 343–351. PMLR, 2021.
- Pytorch: An imperative style, high-performance deep learning library. Advances in Neural Information Processing Systems, 32, 2019.
- nflows: normalizing flows in PyTorch, November 2020.
- Adam: A method for stochastic optimization. arXiv preprint arXiv:1412.6980, 2014.