A Wasserstein Graph Distance Based on Distributions of Probabilistic Node Embeddings (2401.03913v2)
Abstract: Distance measures between graphs are important primitives for a variety of learning tasks. In this work, we describe an unsupervised, optimal transport based approach to define a distance between graphs. Our idea is to derive representations of graphs as Gaussian mixture models, fitted to distributions of sampled node embeddings over the same space. The Wasserstein distance between these Gaussian mixture distributions then yields an interpretable and easily computable distance measure, which can further be tailored for the comparison at hand by choosing appropriate embeddings. We propose two embeddings for this framework and show that under certain assumptions about the shape of the resulting Gaussian mixture components, further computational improvements of this Wasserstein distance can be achieved. An empirical validation of our findings on synthetic data and real-world Functional Brain Connectivity networks shows promising performance compared to existing embedding methods.
- Steven H Strogatz “Exploring complex networks” In Nature 410.6825 Nature Publishing Group UK London, 2001, pp. 268–276
- “STRING v10: protein–protein interaction networks, integrated over the tree of life” In Nucleic acids research 43.D1 Oxford University Press, 2015, pp. D447–D452
- “Complex network measures of brain connectivity: uses and interpretations” In Neuroimage 52.3 Elsevier, 2010, pp. 1059–1069
- “Network analysis in the social sciences” In Science 323.5916 American Association for the Advancement of Science, 2009, pp. 892–895
- “GOT: an optimal transport framework for graph comparison” In Neurips 32, 2019
- “Graph diffusion wasserstein distances” In ECML PKDD, 2020, pp. 577–592 Springer
- “FGOT: Graph distances based on filters and optimal transport” In AAAI 36.7, 2022
- “Deep graph similarity learning for brain data analysis” In Proceedings of the 28th ACM CIKM, 2019
- Rita T Sousa, Sara Silva and Catia Pesquita “Evolving knowledge graph similarity for supervised learning in complex biomedical domains” In BMC bioinformatics 21 Springer, 2020, pp. 1–19
- Somesh Mohapatra, Joyce An and Rafael Gómez-Bombarelli “Chemistry-informed macromolecule graph representation for similarity computation, unsupervised and supervised learning” In Machine Learning: Science and Technology 3.1 IOP Publishing, 2022, pp. 015028
- “A survey of graph edit distance” In Pattern Analysis and applications 13 Springer, 2010, pp. 113–129
- “Simple unsupervised graph representation learning” In AAAI 36, 2022, pp. 7797–7805
- “Learning graph edit distance by graph neural networks” In Pattern Recognition 120 Elsevier, 2021, pp. 108132
- Facundo Mémoli “Gromov–Wasserstein distances and the metric approach to object matching” In Foundations of computational mathematics 11 Springer, 2011, pp. 417–487
- Gabriel Peyré, Marco Cuturi and Justin Solomon “Gromov-wasserstein averaging of kernel and distance matrices” In ICML, 2016 PMLR
- “Optimal transport for structured data with application on graphs” In ICML, 2019 PMLR
- “A Wasserstein-type distance in the space of Gaussian mixture models” In SIAM Journal on Imaging Sciences 13.2 SIAM, 2020, pp. 936–970
- Antoine Salmona, Julie Delon and Agnès Desolneux “Gromov-Wasserstein distances between Gaussian distributions” In arXiv:2104.07970, 2021
- “Role-based similarity in directed networks” In arXiv:1012.2726, 2010
- Michael Scholkemper and Michael T Schaub “Local, global and scale-dependent node roles” In 2021 IEEE ICAS, 2021, pp. 1–5 IEEE
- “A look at Gaussian mixture reduction algorithms” In FUSION, 2011, pp. 1–8 IEEE
- Akbar Assa and Konstantinos N Plataniotis “Wasserstein-distance-based Gaussian mixture reduction” In IEEE Signal Processing Letters 25.10 IEEE, 2018, pp. 1465–1469
- “graph2vec: Learning distributed representations of graphs”, 2017
- “node2vec: Scalable feature learning for networks” In ACM SIGKDD, 2016, pp. 855–864
- “role2vec: Role-based network embeddings” In Proc. DLG KDD, 2019, pp. 1–7
- “On the evolution of random graphs” In Publ. math. inst. hung. acad. sci 5.1, 1960, pp. 17–60
- Duncan J Watts and Steven H Strogatz “Collective dynamics of ’small-world’networks” In Nature 393.6684 Nature Publishing Group, 1998
- “Emergence of scaling in random networks” In Science 286.5439 American Association for the Advancement of Science, 1999, pp. 509–512
- Mark EJ Newman, Steven H Strogatz and Duncan J Watts “Random graphs with arbitrary degree distributions and their applications” In Physical review E 64.2 APS, 2001, pp. 026118
- “Functional connectivity in the motor cortex of resting human brain using echo-planar MRI” In Magnetic resonance in medicine 34.4 Wiley Online Library, 1995, pp. 537–541
- Karl J Friston “Functional and effective connectivity: a review” In Brain connectivity 1.1, 2011, pp. 13–36
- “Allen Brain Atlas: an integrated spatio-temporal portal for exploring the central nervous system” In Nucleic acids research 41.D1 Oxford University Press, 2012
- “Wide-field and two-photon imaging of brain activity with voltage and calcium-sensitive dyes” In Dynamic Brain Imaging: Multi-Modal Methods and In Vivo Applications Springer, 2009, pp. 43–79
- “Imaging cortical dynamics in GCaMP transgenic rats with a head-mounted widefield macroscope” In Neuron 100.5 Elsevier, 2018, pp. 1045–1058
- “In vivo widefield calcium imaging of the mouse cortex for analysis of network connectivity in health and brain disease” In Neuroimage 199 Elsevier, 2019
- “Large-scale imaging of cortical dynamics during sensory perception and behavior” In Journal of neurophysiology 115.6 American Physiological Society Bethesda, MD, 2016, pp. 2852–2866
- “Two-photon calcium imaging in mice navigating a virtual reality environment” In JoVE, 2014, pp. e50885
- “An accumulation-of-evidence task using visual pulses for mice navigating in virtual reality” In Frontiers in behavioral neuroscience 12 Frontiers Media SA, 2018, pp. 36
- “Novel two-alternative forced choice paradigm for bilateral vibrotactile whisker frequency discrimination in head-fixed mice and rats” In Journal of neurophysiology 109.1 American Physiological Society Bethesda, MD, 2013, pp. 273–284
- “Sources of noise during accumulation of evidence in unrestrained and voluntarily head-restrained rats” In Elife 4 eLife Sciences Publications, Ltd, 2015, pp. e11308
- Leland McInnes, John Healy and James Melville “Umap: Uniform manifold approximation and projection for dimension reduction”, 2018
- “Nearest neighbor pattern classification” In IEEE transactions on information theory 13.1 IEEE, 1967, pp. 21–27
- Roman Vershynin “High-dimensional probability” In University of California, Irvine, 2020
- Joel A Tropp “An introduction to matrix concentration inequalities” In Foundations and Trends® in Machine Learning 8.1-2 Now Publishers, Inc., 2015, pp. 1–230