SymmPI: Predictive Inference for Data with Group Symmetries (2312.16160v3)
Abstract: Quantifying the uncertainty of predictions is a core problem in modern statistics. Methods for predictive inference have been developed under a variety of assumptions, often -- for instance, in standard conformal prediction -- relying on the invariance of the distribution of the data under special groups of transformations such as permutation groups. Moreover, many existing methods for predictive inference aim to predict unobserved outcomes in sequences of feature-outcome observations. Meanwhile, there is interest in predictive inference under more general observation models (e.g., for partially observed features) and for data satisfying more general distributional symmetries (e.g., rotationally invariant or coordinate-independent observations in physics). Here we propose SymmPI, a methodology for predictive inference when data distributions have general group symmetries in arbitrary observation models. Our methods leverage the novel notion of distributional equivariant transformations, which process the data while preserving their distributional invariances. We show that SymmPI has valid coverage under distributional invariance and characterize its performance under distribution shift, recovering recent results as special cases. We apply SymmPI to predict unobserved values associated to vertices in a network, where the distribution is unchanged under relabelings that keep the network structure unchanged. In several simulations in a two-layer hierarchical model, and in an empirical data analysis example, SymmPI performs favorably compared to existing methods.
- M. J. Anderson and J. Robinson. Permutation tests for linear models. Australian & New Zealand Journal of Statistics, 43(1):75–88, 2001.
- Conformal prediction: A gentle introduction. Foundations and Trends® in Machine Learning, 16(4):494–591, 2023.
- M. Artin. Algebra. Pearson, 2018.
- L. Babai. Graph isomorphism in quasipolynomial time. In Proceedings of the forty-eighth annual ACM symposium on Theory of Computing, pages 684–697, 2016.
- Effects of sleep schedules on commercial motor vehicle driver performance. Technical report, United States. Department of Transportation. Federal Motor Carrier Safety Safety Administration, 2000.
- Conformal prediction beyond exchangeability. The Annals of Statistics, 51(2):816–845, 2023.
- Distribution-free, risk-controlling prediction sets. Journal of the ACM (JACM), 68(6):1–34, 2021.
- Testing for outliers with conformal p-values. The Annals of Statistics, 51(1):149–178, 2023.
- Patterns of performance degradation and restoration during sleep restriction and subsequent recovery: A sleep dose-response study. Journal of sleep research, 12(1):1–12, 2003.
- B. Blum-Smith and S. Villar. Equivariant maps from invariant functions. arXiv preprint arXiv:2209.14991, 2022.
- Conformalized survival analysis. Journal of the Royal Statistical Society Series B: Statistical Methodology, 85(1):24–45, 2023.
- Learning augmentation distributions using transformed risk minimization. Transactions on Machine Learning Research, 2023. URL https://openreview.net/forum?id=LRYtNj8Xw0.
- A group-theoretic framework for data augmentation. Journal of Machine Learning Research, 21(245):1–71, 2020.
- Exact and Robust Conformal Inference Methods for Predictive Machine Learning With Dependent Data. In Proceedings of the 31st Conference On Learning Theory, 2018.
- T. Cohen and M. Welling. Group equivariant convolutional networks. In International Conference on Machine Learning, 2016.
- D. R. Cox. Principles of statistical inference. Cambridge University Press, 2006.
- A. Dean and J. Verducci. Linear transformations that preserve majorization, schur concavity, and exchangeability. Linear Algebra and its Applications, 127:121–138, 1990.
- P. Diaconis. Group Representations in Probability and Statistics. Institute of Mathematical Statistics, 1988.
- J. Diestel and A. Spalsbury. The joys of Haar measure. American Mathematical Society, 2014.
- E. Dobriban. Consistency of invariance-based randomization tests. The Annals of Statistics, 50(4):2443 – 2466, 2022.
- E. Dobriban and Z. Lin. Joint coverage regions: Simultaneous confidence and prediction sets. arXiv preprint arXiv:2303.00203, 2023.
- Distribution-free prediction sets for two-layer hierarchical models. Journal of the American Statistical Association, pages 1–12, 2022.
- M. L. Eaton. Group invariance applications in statistics. In Regional conference series in Probability and Statistics, 1989.
- T. Eden and F. Yates. On the validity of Fisher’s z test when applied to an actual example of non-normal data. The Journal of Agricultural Science, 23(1):6–17, 1933.
- Training uncertainty-aware classifiers with conformalized deep learning. Advances in Neural Information Processing Systems, 2022.
- M. D. Ernst. Permutation methods: a basis for exact inference. Statistical Science, 19(4):676–685, 2004.
- Generalizing convolutional neural networks for equivariance to lie groups on arbitrary continuous data. In Proceedings of the 37th International Conference on Machine Learning, 2020.
- Few-shot conformal prediction with auxiliary tasks. In International Conference on Machine Learning, 2021.
- R. A. Fisher. The design of experiments. Oliver and Boyd, 1935.
- G. B. Folland. A course in abstract harmonic analysis. CRC Press, 2016.
- K. Fukushima. Neocognitron: A self-organizing neural network model for a mechanism of pattern recognition unaffected by shift in position. Biological cybernetics, 36(4):193–202, 1980.
- W. Fulton and J. Harris. Representation theory: a first course, volume 129. Springer Science & Business Media, 2013.
- Learning by transduction. In Proceedings of the Fourteenth conference on Uncertainty in artificial intelligence, 1998.
- S. Geisser. Predictive inference: an introduction. Chapman and Hall/CRC, 2017.
- Neural message passing for quantum chemistry. In International Conference on Machine Learning, 2017.
- N. C. Giri. Group invariance in statistical inference. World Scientific, 1996.
- P. I. Good. Permutation, parametric, and bootstrap tests of hypotheses. Springer Science & Business Media, 2006.
- D. J. Gross. The role of symmetry in fundamental physics. Proceedings of the National Academy of Sciences, 93(25):14256–14259, 1996.
- L. Guan. A conformal test of linear models via permutation-augmented regressions. arXiv preprint arXiv:2309.05482, 2023a.
- L. Guan. Localized conformal prediction: A generalized inference framework for conformal prediction. Biometrika, 110(1):33–50, 2023b.
- L. Guan and R. Tibshirani. Prediction and outlier detection in classification problems. Journal of the Royal Statistical Society Series B: Statistical Methodology, 84(2):524–546, 2022.
- Conformalized survival analysis with adaptive cutoffs. Biometrica, (to appear), arXiv preprint arXiv:2211.01227, 2022.
- J. Hemerik and J. Goeman. Exact testing with random permutations. Test, 27(4):811–825, 2018.
- W. Hoeffding. The large-sample power of tests based on permutations of observations. The Annals of Mathematical Statistics, pages 169–192, 1952.
- idecode: In-distribution equivariance for conformal out-of-distribution detection. In Proceedings of the AAAI Conference on Artificial Intelligence, 2022.
- F. E. Kennedy. Randomization tests in econometrics. Journal of Business & Economic Statistics, 13(1):85–94, 1995.
- A. K. Kuchibhotla. Exchangeability, conformal prediction, and rank tests. arXiv preprint arXiv:2005.06095, 2020.
- Backpropagation applied to handwritten zip code recognition. Neural computation, 1(4):541–551, 1989.
- Distribution-free inference with hierarchical data. arXiv preprint arXiv:2306.06342, 2023.
- E. L. Lehmann and C. Stein. On the theory of some non-parametric hypotheses. The Annals of Mathematical Statistics, 20(1):28–45, 1949.
- J. Lei. Classification with confidence. Biometrika, 101(4):755–769, 2014.
- J. Lei and L. Wasserman. Distribution-free prediction bands for non-parametric regression. Journal of the Royal Statistical Society: Series B (Statistical Methodology), 76(1):71–96, 2014.
- Distribution-free prediction sets. Journal of the American Statistical Association, 108(501):278–287, 2013.
- A conformal prediction approach to explore functional data. Annals of Mathematics and Artificial Intelligence, 74(1):29–43, 2015.
- Distribution-free predictive inference for regression. Journal of the American Statistical Association, 113(523):1094–1111, 2018.
- Pac-wrap: Semi-supervised pac anomaly detection. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, 2022.
- Integrative conformal p-values for powerful out-of-distribution testing with labeled outliers. arXiv preprint arXiv:2208.11111, 2022.
- Conformal inference is (almost) free for neural networks trained with early stopping. In International Conference on Machine Learning, 2023.
- Conformal prediction for network-assisted regression. arXiv preprint arXiv:2302.10095, 2023.
- An analysis of the effect of invariance on generalization in neural networks. In International Conference on Machine Learning Workshop on Understanding and Improving Generalization in Deep Learning, 2019.
- J. R. Munkres. Topology. Pearson Education, 2019.
- L. Nachbin. The Haar Integral. R. E. Krieger Publishing Company, 1976.
- Inductive confidence machines for regression. In European Conference on Machine Learning. Springer, 2002.
- PAC confidence sets for deep neural networks via calibrated prediction. In International Conference on Learning Representations, 2020.
- PAC prediction sets under covariate shift. In International Conference on Learning Representations, 2021.
- PAC prediction sets for meta-learning. In Advances in Neural Information Processing Systems, 2022.
- F. Pesarin. Multivariate permutation tests: with applications in biostatistics. Wiley, 2001.
- F. Pesarin and L. Salmaso. Permutation tests for complex data: theory, applications and software. John Wiley & Sons, 2010.
- F. Pesarin and L. Salmaso. A review and some new results on permutation testing for multivariate problems. Statistics and Computing, 22(2):639–646, 2012.
- E. J. Pitman. Significance tests which may be applied to samples from any populations. Supplement to the Journal of the Royal Statistical Society, 4(1):119–130, 1937.
- Prediction sets adaptive to unknown covariate shift. Journal of the Royal Statistical Society: Series B (to appear), arXiv preprint arXiv:2203.06126, 2022.
- M. Robinson. Symmetry and the standard model. Springer, 2011.
- Conformalized quantile regression. In Advances in Neural Information Processing Systems, 2019.
- Classification with valid and adaptive coverage. Advances in Neural Information Processing Systems, 2020.
- Least Ambiguous Set-Valued Classifiers With Bounded Error Levels. Journal of the American Statistical Association, 114(525):223–234, 2019.
- Transduction with confidence and credibility. In IJCAI, 1999.
- H. Scheffe and J. W. Tukey. Non-parametric estimation. I. Validation of order statistics. The Annals of Mathematical Statistics, 16(2):187–192, 1945.
- J. Schwichtenberg. Physics from symmetry. Springer, 2018.
- Conformal frequency estimation using discrete sketched data with coverage for distinct queries. Journal of Machine Learning Research, 24(348):1–80, 2023.
- PAC prediction sets under label shift. arXiv preprint arXiv:2310.12964, 2023.
- Conformal prediction under covariate shift. Advances in Neural Information Processing Systems, 32, 2019.
- S. Tornier. Haar measures. arXiv preprint arXiv:2006.10956, 2020.
- J. W. Tukey. Non-parametric estimation II. Statistically equivalent blocks and tolerance regions–the continuous case. The Annals of Mathematical Statistics, 18(4):529–539, 1947.
- J. W. Tukey. Nonparametric estimation, III. Statistically equivalent blocks and multivariate tolerance regions–the discontinuous case. The Annals of Mathematical Statistics, 19(1):30–39, 1948.
- Scalars are universal: Equivariant machine learning, structured like classical physics. Advances in Neural Information Processing Systems, 34:28848–28863, 2021.
- Dimensionless machine learning: Imposing exact units equivariance. Journal of Machine Learning Research, 24(109):1–32, 2023.
- V. Vovk. Conditional validity of inductive conformal predictors. In Asian Conference on Machine Learning, 2013.
- Algorithmic learning in a random world. Springer Science & Business Media, 2005.
- Machine-learning applications of algorithmic randomness. In International Conference on Machine Learning, 1999.
- A. Wald. An Extension of Wilks’ Method for Setting Tolerance Limits. The Annals of Mathematical Statistics, 14(1):45–55, 1943.
- Learning steerable filters for rotation equivariant cnns. In 2018 IEEE Conference on Computer Vision and Pattern Recognition, pages 849–858. IEEE Computer Society, 2018.
- R. A. Wijsman. Invariant measures on groups and their use in statistics. IMS, 1990.
- S. S. Wilks. Determination of Sample Sizes for Setting Tolerance Limits. The Annals of Mathematical Statistics, 12(1):91–96, 1941.
- Representation learning on graphs with jumping knowledge networks. In International Conference on Machine Learning, 2018.