Papers
Topics
Authors
Recent
Assistant
AI Research Assistant
Well-researched responses based on relevant abstracts and paper content.
Custom Instructions Pro
Preferences or requirements that you'd like Emergent Mind to consider when generating responses.
Gemini 2.5 Flash
Gemini 2.5 Flash 134 tok/s
Gemini 2.5 Pro 41 tok/s Pro
GPT-5 Medium 31 tok/s Pro
GPT-5 High 25 tok/s Pro
GPT-4o 57 tok/s Pro
Kimi K2 190 tok/s Pro
GPT OSS 120B 435 tok/s Pro
Claude Sonnet 4.5 37 tok/s Pro
2000 character limit reached

Intrinsic Sliced Wasserstein Distances for Comparing Collections of Probability Distributions on Manifolds and Graphs (2010.15285v3)

Published 28 Oct 2020 in stat.ME and stat.ML

Abstract: Collections of probability distributions arise in a variety of applications ranging from user activity pattern analysis to brain connectomics. In practice these distributions can be defined over diverse domain types including finite intervals, circles, cylinders, spheres, other manifolds, and graphs. This paper introduces an approach for detecting differences between two collections of distributions over such general domains. To this end, we propose the intrinsic slicing construction that yields a novel class of Wasserstein distances on manifolds and graphs. These distances are Hilbert embeddable, allowing us to reduce the distribution collection comparison problem to a more familiar mean testing problem in a Hilbert space. We provide two testing procedures one based on resampling and another on combining p-values from coordinate-wise tests. Our experiments in various synthetic and real data settings show that the resulting tests are powerful and the p-values are well-calibrated.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (46)
  1. Multimodal neuroimaging in schizophrenia: Description and dissemination. Neuroinformatics, 15(4):343–364, Oct 2017. ISSN 1559-0089.
  2. Network classification with applications to brain connectomics. Ann. Appl. Stat., 13(3):1648–1677, 09 2019.
  3. Bakhvalov, N. S. On the approximate calculation of multiple integrals. J. Complexity, 31:502–516, 2015. English translation; the original appeared in Vestnik MGU, Ser. Math. Mech. Astron. Phys. Chem, 4, 3–18, 1959.
  4. Controlling the false discovery rate: A practical and powerful approach to multiple testing. Journal of the Royal Statistical Society. Series B (Methodological), 57(1):289–300, 1995. ISSN 00359246.
  5. Berger, M. A Panoramic View of Riemannian Geometry. Springer-Verlag Berlin Heidelberg, 2003.
  6. Bigot, J. Statistical data analysis in the wasserstein space. ESAIM: ProcS, 68:1–19, 2020.
  7. Essential Mathematics for the Physical Sciences, volume Volume I: Homogeneous boundary value problems, Fourier methods, and special functions, chapter Spherical harmonics and friends. Morgan & Claypool Publishers, 2017. Pages 6–1 to 6–26.
  8. On the Wasserstein distance between classical sequences and the Lebesgue measure. Trans. Amer. Math. Soc., in press, 2020. https://arxiv.org/abs/1909.09046.
  9. City of Chicago. Chicago data portal: Crimes - 2018, 2022. URL https://data.cityofchicago.org/Public-Safety/Crimes-2018/3i3m-jwuy.
  10. Diffusion maps. Applied and Computational Harmonic Analysis, 21(1):5 – 30, 2006. ISSN 1063-5203. Special Issue: Diffusion Maps and Wavelets.
  11. Asymptotic distribution of two-sample empirical U-quantiles with applications to robust tests for shifts in location. Journal of Multivariate Analysis, 105(1):124–140, 2012.
  12. Max-sliced wasserstein distance and its use for gans. In 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), pp.  10640–10648, 2019.
  13. Frćhet analysis of variance for random objects. Biometrika, 106(4):803–821, 10 2019. ISSN 0006-3444.
  14. Large deviations and gradient flows for the Brownian one-dimensional hard-rod system. https://arxiv.org/abs/1909.02054, 2019.
  15. Good, I. J. Significance tests in parallel and in series. Journal of the American Statistical Association, 53(284):799–813, 1958. ISSN 01621459.
  16. A Comparison of Tests for the One-Way ANOVA Problem for Functional Data. Computational Statistics, 30:987–1010, 2015.
  17. A kernel two-sample test. J. Mach. Learn. Res., 13:723–773, March 2012. ISSN 1532-4435.
  18. The gradient estimate of a neumann eigenfunction on a compact manifold with boundary. Chinese Annals of Mathematics, Series B, 36(6):991–1000, Nov 2015. ISSN 1860-6261.
  19. To permute or not to permute. Bioinformatics, 22(18):2244–2248, 07 2006. ISSN 1367-4803.
  20. Generalized sliced wasserstein distances. In Wallach, H., Larochelle, H., Beygelzimer, A., Alché-Buc, F., Fox, E., and Garnett, R. (eds.), Advances in Neural Information Processing Systems 32, pp.  261–272. Curran Associates, Inc., 2019a.
  21. Sliced wasserstein auto-encoders. In 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA, May 6-9, 2019. OpenReview.net, 2019b. URL https://openreview.net/forum?id=H1xaJn05FQ.
  22. Multiscale Nonrigid Point Cloud Registration Using Rotation-Invariant Sliced-Wasserstein Distance via Laplace–Beltrami Eigenmap. In SIAM J. Imaging Sci., volume 10 of 2, pp. 449–483, 2017.
  23. Tree-sliced variants of wasserstein distances. In Advances in Neural Information Processing Systems, volume 32, 2019.
  24. Sobolev transport: A scalable metric for probability measures with graph metrics. In Proceedings of The 25th International Conference on Artificial Intelligence and Statistics, volume 151 of Proceedings of Machine Learning Research, pp.  9844–9868. PMLR, 2022.
  25. Biharmonic distance. ACM Trans. Graph., 29(3), July 2010. ISSN 0730-0301.
  26. Cauchy combination test: A powerful test with analytic p-value calculation under arbitrary dependency structures. Journal of the American Statistical Association, 115(529):393–402, 2020.
  27. A Model for Cylindrical Variables with Applications. Journal of the Royal Statistical Society: Series B (Methodological), 40(2):229–233, 1978.
  28. Universal kernels. J. Mach. Learn. Res., 7:2651–2667, December 2006. ISSN 1532-4435.
  29. Differential Geometry and Statistics, chapter The definition of a statistical manifold (Chapter 3.2). Chapman & Hall, 1993.
  30. NHANES. 2005-2006 Data Documentation, Codebook, and Frequencies, 2008. URL {https://wwwn.cdc.gov/Nchs/Nhanes/2005-2006/PAXRAW_D.htm}.
  31. Statistical aspects of wasserstein distances. Annual Review of Statistics and Its Application, 6(1):405–431, 2019.
  32. Functional data analysis for density functions by transformation to a hilbert space. Ann. Statist., 44(1):183–218, 02 2016.
  33. Wasserstein covariance for multiple random densities. Biometrika, 106(2):339–351, 04 2019. ISSN 0006-3444. doi: 10.1093/biomet/asz005. URL https://doi.org/10.1093/biomet/asz005.
  34. Computational optimal transport: With applications to data science. Foundations and Trends in Machine Learning, 11(5-6):355–607, 2019. ISSN 1935-8237.
  35. Functional network organization of the human brain. Neuron, 72(4):665–678, Nov 2011. ISSN 1097-4199. 22099467[pmid].
  36. Methods of modern mathematical physics. Vol. 1: Functional Analysis. Academic Press, 1080.
  37. Laplace-Beltrami eigenvalues and topological features of eigenfunctions for statistical shape analysis. Computer-Aided Design, 41(10):739–755, 2009.
  38. DISCO analysis: A nonparametric extension of analysis of variance. Ann. Appl. Stat., 4:1034–1055, 2010.
  39. The earth mover’s distance as a metric for image retrieval. International Journal of Computer Vision, 40(2):99–121, 2000. doi: 10.1023/A:1026543900054.
  40. Kernel mean embedding based hypothesis tests for comparing spatial point patterns. Spatial Statistics, 38:100459, 2020. ISSN 2211-6753.
  41. Serfling, R. J. Approximation theorems of mathematical statistics. John Wiley & Sons, 2009.
  42. Wasserstein propagation for semi-supervised learning. In Proceedings of the 31th International Conference on Machine Learning, ICML 2014, Beijing, China, 21-26 June 2014, volume 32 of JMLR Workshop and Conference Proceedings, pp.  306–314. JMLR.org, 2014.
  43. Villani, C. Topics in optimal transportation. Graduate studies in mathematics. American mathematical society, Providence, Rhode Island, 2003. ISBN 0-8218-3312-X.
  44. Wilson, D. J. The harmonic mean p-value for combining dependent tests. Proceedings of the National Academy of Sciences, 116(4):1195–1200, 2019. ISSN 0027-8424.
  45. Zhang, J.-T. Analysis of Variance for Functional Data. Chapman and Hall/CRC, first edition, 2013.
  46. A new test for functional one-way ANOVA with applications to ischemic heart screening. Computational Statistics & Data Analysis, 132:3–17, 2019.
Citations (12)

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Lightbulb Streamline Icon: https://streamlinehq.com

Continue Learning

We haven't generated follow-up questions for this paper yet.

List To Do Tasks Checklist Streamline Icon: https://streamlinehq.com

Collections

Sign up for free to add this paper to one or more collections.