Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
162 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

CA-PCA: Manifold Dimension Estimation, Adapted for Curvature (2309.13478v2)

Published 23 Sep 2023 in stat.ML and cs.LG

Abstract: The success of algorithms in the analysis of high-dimensional data is often attributed to the manifold hypothesis, which supposes that this data lie on or near a manifold of much lower dimension. It is often useful to determine or estimate the dimension of this manifold before performing dimension reduction, for instance. Existing methods for dimension estimation are calibrated using a flat unit ball. In this paper, we develop CA-PCA, a version of local PCA based instead on a calibration of a quadratic embedding, acknowledging the curvature of the underlying manifold. Numerous careful experiments show that this adaptation improves the estimator in a wide range of settings.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (29)
  1. Nonasymptotic rates for manifold, tangent space and curvature estimation. The Annals of Statistics, 47(1):177–204, 2019.
  2. Non-parametric estimation of manifolds from noisy data. arXiv preprint arXiv:2105.04754, 2021.
  3. Estimating the dimensionality of the manifold underlying multi-electrode neural recordings. PLoS computational biology, 17(11):e1008591, 2021.
  4. Random projections of smooth manifolds. Foundations of Computational Mathematics, 9(1):51–77, 2009.
  5. Density estimation on manifolds with boundary. Computational Statistics & Data Analysis, 107:1–17, 2017.
  6. Intrinsic dimension estimation using wasserstein distance. Journal of Machine Learning Research, 23(313):1–37, 2022.
  7. Intrinsic dimension estimation by maximum likelihood in isotropic probabilistic PCA. Pattern Recognition Letters, 32(14):1706–1713, 2011.
  8. Intrinsic dimension estimation: Advances and open problems. Information Sciences, 328:26–41, 2016.
  9. Intrinsic dimension estimation: relevant techniques and a benchmark framework. Math. Probl. Eng., pages Art. ID 759567, 21, 2015.
  10. De-biasing for intrinsic dimension estimation. In 2007 IEEE/SP 14th Workshop on Statistical Signal Processing, pages 601–605. IEEE, 2007.
  11. Estimating differential quantities using polynomial fitting of osculating jets. Computer Aided Geometric Design, 22(2):121–146, 2005.
  12. Multidimensional scaling. Handbook of data visualization, pages 315–347, 2008.
  13. Kenneth L Clarkson. Tighter bounds for random projections of manifolds. In Proceedings of the twenty-fourth annual symposium on Computational geometry, pages 39–48, 2008.
  14. Distributional results for model-based intrinsic dimension estimators. arXiv preprint arXiv:2104.13832, 2021.
  15. Gerald B. Folland. How to integrate a polynomial over a sphere. Amer. Math. Monthly, 108(5):446–448, 2001.
  16. Two-way multidimensional scaling: A review. IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews), 41(5):644–661, 2010.
  17. An algorithm for finding intrinsic dimensionality of data. IEEE Transactions on Computers, 100(2):176–183, 1971.
  18. Effective estimation of the dimensions of a manifold from random samples. arXiv preprint arXiv:2209.01839, 2022.
  19. Sparse probabilistic principal component analysis. In Artificial Intelligence and Statistics, pages 185–192. PMLR, 2009.
  20. Maximum likelihood estimation of intrinsic dimension. Advances in neural information processing systems, 17, 2004.
  21. Efficient manifold approximation with spherelets. J. R. Stat. Soc. Ser. B. Stat. Methodol., 84(4):1129–1149, 2022.
  22. Tangent space and dimension estimation with the Wasserstein distance. arXiv preprint arXiv:2110.06357, 2021.
  23. Riemannian manifold learning. IEEE transactions on pattern analysis and machine intelligence, 30(5):796–809, 2008.
  24. Estimation of intrinsic dimensionality of samples from noisy low-dimensional manifolds in high dimensions with multiscale SVD. In 2009 IEEE/SP 15th Workshop on Statistical Signal Processing, pages 85–88. IEEE, 2009.
  25. The role of intrinsic dimension in high-resolution player tracking data—insights in basketball. The Annals of Applied Statistics, 16(1):326–348, 2022.
  26. A global geometric framework for nonlinear dimensionality reduction. science, 290(5500):2319–2323, 2000.
  27. Tangent space estimation for smooth embeddings of Riemannian manifolds. Information and Inference: A Journal of the IMA, 2(1):69–114, 2013.
  28. Laurens Van der Maaten and Geoffrey Hinton. Visualizing data using t-sne. Journal of machine learning research, 9(11), 2008.
  29. Peter J. Verveer and Robert P. W. Duin. An evaluation of intrinsic dimensionality estimators. IEEE Transactions on pattern analysis and machine intelligence, 17(1):81–86, 1995.

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets