Manifold Regularization Classification Model Based On Improved Diffusion Map (2403.16059v1)
Abstract: Manifold regularization model is a semi-supervised learning model that leverages the geometric structure of a dataset, comprising a small number of labeled samples and a large number of unlabeled samples, to generate classifiers. However, the original manifold norm limits the performance of models to local regions. To address this limitation, this paper proposes an approach to improve manifold regularization based on a label propagation model. We initially enhance the probability transition matrix of the diffusion map algorithm, which can be used to estimate the Neumann heat kernel, enabling it to accurately depict the label propagation process on the manifold. Using this matrix, we establish a label propagation function on the dataset to describe the distribution of labels at different time steps. Subsequently, we extend the label propagation function to the entire data manifold. We prove that the extended label propagation function converges to a stable distribution after a sufficiently long time and can be considered as a classifier. Building upon this concept, we propose a viable improvement to the manifold regularization model and validate its superiority through experiments.
- Semi-supervised support vector machines. Advances in Neural Information processing systems, 11, 1998.
- David Yarowsky. Unsupervised word sense disambiguation rivaling supervised methods. In 33rd annual meeting of the association for computational linguistics, pages 189–196, 1995.
- On discriminative vs. generative classifiers: A comparison of logistic regression and naive bayes. Advances in neural information processing systems, 14, 2001.
- Learning from labeled and unlabeled data with label propagation. ProQuest number: information to all users, 2002.
- Laplacian eigenmaps and spectral techniques for embedding and clustering. Advances in neural information processing systems, 14, 2001.
- Manifold regularization: A geometric framework for learning from labeled and unlabeled examples. Journal of machine learning research, 7(11), 2006.
- Regularization and semi-supervised learning on large graphs. In Learning Theory: 17th Annual Conference on Learning Theory, COLT 2004, Banff, Canada, July 1-4, 2004. Proceedings 17, pages 624–638. Springer, 2004.
- The Feynman lectures on physics, Vol. I: The new millennium edition: mainly mechanics, radiation, and heat, volume 1. Basic books, 2011.
- Introduction to heat transfer. John Wiley & Sons, 2011.
- Diffusion maps. Applied and computational harmonic analysis, 21(1):5–30, 2006.
- Geodesic distance estimation with spherelets. arXiv preprint arXiv:1907.00296, 2019.
- Introduction to algorithms, volume 3. MIT press Cambridge, MA, USA, 1994.
- Kôsaku Yosida. Functional analysis, volume 123. Springer Science & Business Media, 2012.
- E. B. Davies. Heat Kernels and Spectral Theory. Cambridge Tracts in Mathematics. Cambridge University Press, 1989.
- Manfredo Perdigao Do Carmo and J Flaherty Francis. Riemannian geometry, volume 6. Springer, 1992.
- Lawrence C Evans. Partial differential equations, volume 19. American Mathematical Society, 2022.
- W. Rudin. Real and Complex Analysis. Higher Mathematics Series. McGraw-Hill, 1974.
- Thierry Aubin. Some nonlinear problems in Riemannian geometry. Springer Science & Business Media, 2013.
- On the parabolic kernel of the schrödinger operator. Acta Mathematica, 156:153–201, 1986.