The Umeyama algorithm for matching correlated Gaussian geometric models in the low-dimensional regime (2402.15095v1)
Abstract: Motivated by the problem of matching two correlated random geometric graphs, we study the problem of matching two Gaussian geometric models correlated through a latent node permutation. Specifically, given an unknown permutation $\pi*$ on ${1,\ldots,n}$ and given $n$ i.i.d. pairs of correlated Gaussian vectors ${X_{\pi*(i)},Y_i}$ in $\mathbb{R}d$ with noise parameter $\sigma$, we consider two types of (correlated) weighted complete graphs with edge weights given by $A_{i,j}=\langle X_i,X_j \rangle$, $B_{i,j}=\langle Y_i,Y_j \rangle$. The goal is to recover the hidden vertex correspondence $\pi*$ based on the observed matrices $A$ and $B$. For the low-dimensional regime where $d=O(\log n)$, Wang, Wu, Xu, and Yolou [WWXY22+] established the information thresholds for exact and almost exact recovery in matching correlated Gaussian geometric models. They also conducted numerical experiments for the classical Umeyama algorithm. In our work, we prove that this algorithm achieves exact recovery of $\pi*$ when the noise parameter $\sigma=o(d{-3}n{-2/d})$, and almost exact recovery when $\sigma=o(d{-3}n{-1/d})$. Our results approach the information thresholds up to a $\operatorname{poly}(d)$ factor in the low-dimensional regime.
- A. Armiti and M. Gertz. Geometric graph matching and similarity: A probabilistic approach. In Proceedings of the 26th International Conference on Scientific and Statistical Database Management, pages 1–12, 2014.
- (Nearly) efficient algorithms for the graph matching problem on correlated random graphs. In Advances in Neural Information Processing Systems (NIPS), volume 32. Curran Associates, Inc., 2019.
- Shape matching and object recognition using low distortion correspondences. In 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR), volume 1, pages 26–33, 2005.
- S. N. Bernstein. On a modification of Chebyshev’s inequality and of the error formula of Laplace. In Ann. Sci. Inst. Sav. Ukraine, Sect. Math, 1(4):38–49, 1924.
- Seedless graph matching via tail of degree distribution for correlated Erdős-Rényi graphs. Preprint, arXiv:1907.06334.
- Balanced graph matching. In Advances in Neural Information Processing Systems (NIPS), volume 19. MIT Press, 2006.
- D. Cullina and N. Kiyavash. Exact alignment recovery for correlated Erdős-Rényi graphs. Preprint, arXiv:1711.06783.
- D. Cullina and N. Kiyavash. Improved achievability and converse bounds for Erdős-Rényi graph matching. In Proceedings of the 2016 ACM International Conference on Measurement and Modeling of Computer Science (SIGMETRICS), pages 63–72, New York, NY, USA, 2016. Association for Computing Machinery.
- Partial recovery of Erdős-Rényi graph alignment via k𝑘kitalic_k-core alignment. In Proceedings of the 2016 ACM International Conference on Measurement and Modeling of Computer Science (SIGMETRICS), pages 99–100, New York, NY, USA, 2020. Association for Computing Machinery.
- Database alignment with Gaussian features. In The 22nd International Conference on Artificial Intelligence and Statistics, pages 3225–3233. PMLR, 2019.
- Achievability of nearly-exact alignment for correlated Gaussian databases. In 2020 IEEE International Symposium on Information Theory (ISIT), pages 1230–1235. IEEE, 2020.
- Analysis of a canonical labeling algorithm for the alignment of correlated Erdős-Rényi graphs. In Proceedings of the ACM on Measurement and Analysis of Computing Systems, 3(2), jun 2019.
- C. Davis and W. Kahan. The rotation of eigenvectors by a perturbation. In SIAM Journal on Numerical Analysis, 7:1–46, 1970.
- J. Ding and H. Du. Detection threshold for correlated Erdős-Rényi graphs via densest subgraph. In IEEE Transactions on Information Theory, 69(8):5289–5298, 2023.
- J. Ding and H. Du. Matching recovery threshold for correlated random graphs. In Annals of Statistics, 51(4): 1718-1743, 2023.
- J. Ding, H. Du and S. Gong. A polynomial-time approximation scheme for the maximal overlap of two independent Erdős-Rényi graphs. To appear in Random Structures and Algorithms.
- J. Ding, H. Du and Z. Li. Low-Degree Hardness of Detection for Correlated Erdős-Rényi Graphs. Preprint, arXiv: 2311.15931.
- J. Ding, Y. Fei and Y. Wang. Efficiently matching random inhomogeneous graphs via degree profiles. Preprint, arXiv: 2310.10441.
- J. Ding and Z. Li. A polynomial time iterative algorithm for matching Gaussian matrices with non-vanishing correlation. Preprint, arXiv:2212.13677.
- J. Ding and Z. Li. A polynomial-time iterative algorithm for random graph matching with non-vanishing correlation. Preprint, arXiv:2306.00266.
- Efficient random graph matching via degree profiles. In Probability Theory and Related Fields, 179(1-2):29–115, 2021.
- H. Du, S. Gong and R. Huang. The Algorithmic Phase Transition of Random Graph Alignment Problem. Preprint, arXiv:2307.06590.
- Spectral graph matching and regularized quadratic relaxations I: Algorithm and theory. In Foundations of Computational Mathematics, 23(5):1511-1565, 2023.
- Spectral graph matching and regularized quadratic relaxations II: Erdős-Rényi graphs and universality. In Foundations of Computational Mathematics, 23(5):1567-1617, 2023.
- Spectral alignment of networks. Preprint, arXiv:1602.04181.
- R. Feng, G. Tian and D. Wei. Small gaps of GOE. In Geometric and Functional Analysis, 29(6):1794–1827, 2019.
- L. Ganassali and L. Massoulié. From tree matching to sparse graph alignment. In Proceedings of Thirty Third Conference on Learning Theory (COLT), volume 125 of Proceedings of Machine Learning Research, pages 1633–1665, 2020.
- Impossibility of Partial Recovery in the Graph Alignment Problem. In Proceedings of Thirty Fourth Conference on Learning Theory (COLT), volume 134 of Proceedings of Machine Learning Research, pages 2080–2102, 2021.
- Correlation detection in trees for planted graph alignment. To appear in Annals of Applied Probability.
- Statistical limits of correlation detection in trees. Preprint, arXiv:2209.13723.
- Robust textual inference via graph matching. In Proceedings of Human Language Technology Conference and Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 387–394, Vancouver, British Columbia, Canada, Oct 2005.
- G. Hall and L. Massoulié. Partial recovery in the graph alignment problem. In Operations Research, 71(1):259–272, 2023.
- Growing a graph matching from a handful of seeds. In Proc. VLDB Endow., 8(10):1010–1021, jun 2015.
- H. W. Kuhn. The Hungarian method for the assignment problem. In Naval research logistics quarterly, 2(1-2):83–97, 1955.
- D. Kunisky and J. N. Weed. Strong recovery of geometric planted matchings. In In Proceedings of the 2022 Annual ACM-SIAM Symposium on Discrete Algorithms (SODA), pages 834–876. SIAM, 2022.
- Seeded graph matching for correlated Erdős-Rényi graphs. In Journal of Machine Learning Research, 15:3513–3540, 2014.
- Random Graph Matching with Improved Noise Robustness. In Proceedings of Thirty Fourth Conference on Learning Theory (COLT), volume 134 of Proceedings of Machine Learning Research, pages 3296–3329, 2021.
- Exact matching of random graphs with constant correlation. In Probability Theory and Related Fields, 186(2):327–389, 2023.
- Testing network correlation efficiently via counting trees. To appear in Annals of Statistics.
- Random graph matching at Otter’s threshold via counting chandeliers. In Proceedings of the 55th Annual ACM Symposium on Theory of Computing (STOC), pages 1345–1356, 2023.
- E. Mossel and J. Xu. Seeded graph matching via large neighborhood statistics. In Random Structures and Algorithms, 57(3):570–611, 2020.
- Multivariate normal approximation using Stein’s method and Malliavin calculus. In Annales de l’IHP Probabilités et statistiques, 46(1):45–58, 2010.
- A. Narayanan and V. Shmatikov. Robust de-anonymization of large sparse datasets. In 2008 IEEE Symposium on Security and Privacy (ISSP), pages 111–125, 2008.
- A. Narayanan and V. Shmatikov. De-anonymizing social networks. In 2009 30th IEEE Symposium on Security and Privacy (ISSP), pages 173–187, 2009.
- P. Pedarsani and M. Grossglauser. On the privacy of anonymized networks. In Proceedings of the 17th ACM SIGKDD International Conference on Knowledge Discovery and Data Mining, pages 1235–1243, New York, NY, USA, 2011. Association for Computing Machinery.
- M. Z. Racz and A. Sridhar. Correlated randomly growing graphs. In Annals of Applied Probability, 32(2):1058–1111, 2022.
- M. Z. Racz and A. Sridhar. Correlated stochastic block models: Exact graph matching with applications to recovering communities. In Advances in Neural Information Processing Systems (NIPS), volume 34. Curran Associates, Inc., 2021.
- Seeded graph matching: Efficient algorithms and theoretical guarantees. In 2017 51st Asilomar Conference on Signals, Systems, and Computers, pages 253–257, 2017.
- Global alignment of multiple protein interaction networks with application to functional orthology detection. In Proceedings of the National Academy of Sciences of the United States of America, 105:12763–12768, 10 2008.
- S. Umeyama. An eigen-decomposition approach to weighted graph matching problems. In IEEE Transactions on Pattern Analysis and Machine Intelligence, 10(5):695–703, 1988.
- G. W. Stewart. The Efficient Generation of Random Orthogonal Matrices with an Application to Condition Estimators. In SIAM Journal on Numerical Analysis, 17(3):403–409, 1980.
- R. Vershynin. High-dimensional probability: An introduction with applications in data science. Cambridge university press, 2018.
- Fast approximate quadratic programming for graph matching. In PLOS ONE, 10(4):1–17, 2015.
- Random graph matching in geometric models: the case of complete graphs. In Proceedings of 35th Conference on Learning Theory (COLT), volume 178 of Proceedings of Machine Learning Research, pages 3441–3488, 2022.
- Y. Wu, J. Xu and S. H. Yu. Testing correlation of unlabeled random graphs. In Annals of Applied Probability, 33(4): 2519–2558, 2023.
- Y. Wu, J. Xu and S. H. Yu. Settling the sharp reconstruction thresholds of random graph matching. In IEEE Transactions on Information Theory, 68(8):5391–5417, 2022.
- L. Yartseva and M. Grossglauser. On the performance of percolation graph matching. In Proceedings of the First ACM Conference on Online Social Networks (COSN), pages 119–130, New York, NY, USA, 2013. Association for Computing Machinery.
- A useful variant of the Davis-Kahan theorem for statisticians. In Biometrika, 102(2):315–323, 2015.