Mitigating Over-Smoothing and Over-Squashing using Augmentations of Forman-Ricci Curvature (2309.09384v3)
Abstract: While Graph Neural Networks (GNNs) have been successfully leveraged for learning on graph-structured data across domains, several potential pitfalls have been described recently. Those include the inability to accurately leverage information encoded in long-range connections (over-squashing), as well as difficulties distinguishing the learned representations of nearby nodes with growing network depth (over-smoothing). An effective way to characterize both effects is discrete curvature: Long-range connections that underlie over-squashing effects have low curvature, whereas edges that contribute to over-smoothing have high curvature. This observation has given rise to rewiring techniques, which add or remove edges to mitigate over-smoothing and over-squashing. Several rewiring approaches utilizing graph characteristics, such as curvature or the spectrum of the graph Laplacian, have been proposed. However, existing methods, especially those based on curvature, often require expensive subroutines and careful hyperparameter tuning, which limits their applicability to large-scale graphs. Here we propose a rewiring technique based on Augmented Forman-Ricci curvature (AFRC), a scalable curvature notation, which can be computed in linear time. We prove that AFRC effectively characterizes over-smoothing and over-squashing effects in message-passing GNNs. We complement our theoretical results with experiments, which demonstrate that the proposed approach achieves state-of-the-art performance while significantly reducing the computational cost in comparison with other methods. Utilizing fundamental properties of discrete curvature, we propose effective heuristics for hyperparameters in curvature-based rewiring, which avoids expensive hyperparameter searches, further improving the scalability of the proposed approach.
- On the bottleneck of graph neural networks and its practical implications. In International Conference on Learning Representations, 2021. URL https://openreview.net/forum?id=i80OPhOCVH2.
- Oversquashing in gnns through the lens of information contraction and graph expansion. In 2022 58th Annual Allerton Conference on Communication, Control, and Computing (Allerton), pages 1–8. IEEE, 2022.
- Understanding oversquashing in gnns through the lens of effective resistance. In International Conference on Machine Learning, pages 2528–2547. PMLR, 2023.
- A note on over-smoothing for graph neural networks. arXiv preprint arXiv:2006.13318, 2020.
- Expander graph propagation. In Proceedings of the First Learning on Graphs Conference, 2022.
- On over-squashing in message passing neural networks: The impact of width, depth, and topology. In Andreas Krause, Emma Brunskill, Kyunghyun Cho, Barbara Engelhardt, Sivan Sabato, and Jonathan Scarlett, editors, Proceedings of the 40th International Conference on Machine Learning, volume 202 of Proceedings of Machine Learning Research, pages 7865–7885. PMLR, 23–29 Jul 2023.
- Long range graph benchmark. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track, 2022. URL https://openreview.net/forum?id=in7XC5RcjEn.
- M. Erbar and J. Maas. Ricci curvature of finite markov chains via convexity of the entropy. Archive for Rational Mechanics and Analysis, 206:997–1038, 2012.
- Augmentations of forman’s ricci curvature and their applications in community detection. arXiv preprint arXiv:2306.06474, 2023.
- Robin Forman. Bochner’s Method for Cell Complexes and Combinatorial Ricci Curvature. volume 29, pages 323–374, 2003.
- Structure-based protein function prediction using graph convolutional networks. Nature communications, 12(1):3168, 2021.
- A new model for learning in graph domains. In Proceedings. 2005 IEEE international joint conference on neural networks, volume 2, pages 729–734, 2005.
- Unfolding the multiscale structure of networks with dynamical Ollivier-Ricci curvature. Nature Communications, 12(1), December 2021.
- Inductive Representation Learning on Large Graphs. In NIPS, pages 1024–1034, 2017.
- J. Jost and S. Liu. Ollivier’s Ricci curvature, local clustering and curvature-dimension inequalities on graphs. Discrete & Computational Geometry, 51(2):300–322, 2014.
- Fosr: First-order spectral rewiring for addressing oversquashing in gnns. arXiv preprint arXiv:2210.11790, 2022.
- Semi-Supervised Classification with Graph Convolutional Networks. In ICLR, 2017.
- Deeper insights into graph convolutional networks for semi-supervised learning. In Proceedings of the AAAI conference on artificial intelligence, volume 32, 2018.
- Identifying the latent space geometry of network models through analysis of curvature, May 2022. arXiv:2012.10559 [cs, math, stat].
- Weisfeiler and leman go neural: Higher-order graph neural networks. In Proceedings of the AAAI conference on artificial intelligence, volume 33, pages 4602–4609, 2019.
- Tudataset: A collection of benchmark datasets for learning with graphs. CoRR, abs/2007.08663, 2020. URL https://arxiv.org/abs/2007.08663.
- Revisiting over-smoothing and over-squashing using ollivier-ricci curvature. In International Conference on Machine Learning, pages 25956–25979. PMLR, 2023.
- Community detection on networks with ricci flow. Scientific reports, 9(1):1–12, 2019.
- Y. Ollivier. Ricci curvature of markov chains on metric spaces. Journal of Functional Analysis, 256(3):810–864, 2009.
- Graph neural networks exponentially lose expressive power for node classification. In International Conference on Learning Representations, 2020.
- Geom-gcn: Geometric graph convolutional networks. CoRR, abs/2002.05287, 2020. URL https://arxiv.org/abs/2002.05287.
- Multi-scale attributed node embedding. CoRR, abs/1909.13021, 2019. URL http://arxiv.org/abs/1909.13021.
- Graph-coupled oscillator networks. In International Conference on Machine Learning, pages 18888–18909. PMLR, 2022.
- Graph neural networks in particle physics. Machine Learning: Science and Technology, 2(2):021001, 2020.
- Ollivier-ricci curvature-based method to community detection in complex networks. Scientific reports, 9(1):1–12, 2019.
- Curvature-based clustering on graphs. arXiv preprint arXiv:2307.10155, 2023.
- Understanding over-squashing and bottlenecks on graphs via curvature. In International Conference on Learning Representations, 2022.
- Graph Attention Networks. In ICLR, 2018.
- Melanie Weber. Neighborhood growth determines geometric priors for relational representation learning. In International Conference on Artificial Intelligence and Statistics, volume 108, pages 266–276, 2020.
- Characterizing complex networks with forman-ricci curvature and associated geometric flows. Journal of Complex Networks, 5(4):527–550, 2017a.
- Curvature-based methods for brain network analysis. arXiv:1707.00180, 2017b.
- Graph neural networks in recommender systems: a survey. ACM Computing Surveys, 55(5):1–37, 2022.
- How powerful are graph neural networks? arXiv preprint arXiv:1810.00826, 2018.
- Revisiting semi-supervised learning with graph embeddings. CoRR, abs/1603.08861, 2016. URL http://arxiv.org/abs/1603.08861.
- Modeling polypharmacy side effects with graph convolutional networks. Bioinformatics, 34(13):i457–i466, 2018.