Optimizing Long-tailed Link Prediction in Graph Neural Networks through Structure Representation Enhancement (2407.20499v1)
Abstract: Link prediction, as a fundamental task for graph neural networks (GNNs), has boasted significant progress in varied domains. Its success is typically influenced by the expressive power of node representation, but recent developments reveal the inferior performance of low-degree nodes owing to their sparse neighbor connections, known as the degree-based long-tailed problem. Will the degree-based long-tailed distribution similarly constrain the efficacy of GNNs on link prediction? Unexpectedly, our study reveals that only a mild correlation exists between node degree and predictive accuracy, and more importantly, the number of common neighbors between node pairs exhibits a strong correlation with accuracy. Considering node pairs with less common neighbors, i.e., tail node pairs, make up a substantial fraction of the dataset but achieve worse performance, we propose that link prediction also faces the long-tailed problem. Therefore, link prediction of GNNs is greatly hindered by the tail node pairs. After knowing the weakness of link prediction, a natural question is how can we eliminate the negative effects of the skewed long-tailed distribution on common neighbors so as to improve the performance of link prediction? Towards this end, we introduce our long-tailed framework (LTLP), which is designed to enhance the performance of tail node pairs on link prediction by increasing common neighbors. Two key modules in LTLP respectively supplement high-quality edges for tail node pairs and enforce representational alignment between head and tail node pairs within the same category, thereby improving the performance of tail node pairs.
- Lada A Adamic and Eytan Adar. 2003. Friends and neighbors on the web. Social networks, 211–230.
- Albert-László Barabási and Réka Albert. 1999. Emergence of scaling in random networks. science, 509–512.
- Grarep: Learning graph representations with global structural information. In Proceedings of the 24th ACM international on conference on information and knowledge management. 891–900.
- Graph Neural Networks for Link Prediction with Subgraph Sketching. In ICLR.
- Hyperspherical variational auto-encoders. arXiv preprint arXiv:1804.00891 (2018).
- Simplify and robustify negative sampling for implicit collaborative filtering. In NeurIPS. 1094–1105.
- Inductive representation learning on large graphs. In NeurIPS.
- Lightgcn: Simplifying and powering graph convolution network for recommendation. In SIGIR. 639–648.
- Leveraging meta-path based context for top-n recommendation with a neural co-attention model. In SIGKDD. 1531–1540.
- Open graph benchmark: Datasets for machine learning on graphs. In NeurIPS. 22118–22133.
- Few-shot link prediction via graph neural networks for covid-19 drug-repurposing. ICML (2020).
- Leo Katz. 1953. A new status index derived from sociometric analysis. Psychometrika, 39–43.
- Alex Kendall and Yarin Gal. 2017. What uncertainties do we need in bayesian deep learning for computer vision? NeurIPS 30 (2017).
- Thomas N Kipf and Max Welling. 2016a. Semi-supervised classification with graph convolutional networks. In ICLR.
- Thomas N Kipf and Max Welling. 2016b. Variational graph auto-encoders. arXiv preprint arXiv:1611.07308 (2016).
- Link prediction techniques, applications, and performance: A survey. Physica A: Statistical Mechanics and its Applications (2020), 124289.
- Evaluating graph neural networks for link prediction: Current pitfalls and new benchmarking. NeurIPS 36 (2024).
- Fairlp: Towards fair link prediction on social network graphs. In AAAI, Vol. 16. 628–639.
- On size-oriented long-tailed graph classification of graph neural networks. In WWW. 1506–1516.
- Tail-gnn: Tail-node graph neural networks. In SIGKDD. 1109–1119.
- Towards locality-aware meta-learning of tail node embeddings on networks. In CIKM. 975–984.
- Linyuan Lü and Tao Zhou. 2011. Link prediction in complex networks: A survey. Physica A: statistical mechanics and its applications (2011), 1150–1170.
- Edge proposal sets for link prediction. arXiv preprint arXiv:2106.15810 (2021).
- Link prediction in relational data. In NeurIPS.
- Inductive relation prediction by subgraph reasoning. In ICML. 9448–9457.
- Uncovering the Structural Fairness in Graph Contrastive Learning. NeurIPS 35 (2022), 32465–32473.
- Neural Common Neighbor with Completion for Link Prediction. arXiv preprint arXiv:2302.00890 (2023).
- Neural common neighbor with completion for link prediction. (2024).
- Not All Negatives AreWorth Attending to: Meta-Bootstrapping Negative Sampling Framework for Link Prediction. WSDM (2024).
- Molecular contrastive learning of representations via graph neural networks. Nature Machine Intelligence (2022), 279–287.
- A Topological Perspective on Demystifying GNN-Based Link Prediction Performance. (2024).
- A comprehensive survey on graph neural networks. IEEE transactions on neural networks and learning systems (2020), 4–24.
- How powerful are graph neural networks?. In ICLR.
- Inductive Link Prediction with Interactive Structure Learning on Attributed Graph. In ECML PKDD. 383–398.
- Financial risk analysis for SMEs with graph-based supply chain mining. In IJCAI. 4661–4667.
- Revisiting semi-supervised learning with graph embeddings. In ICML. 40–48.
- Graph convolutional neural networks for web-scale recommender systems. In SIGKDD. 974–983.
- Neo-gnns: Neighborhood overlap-aware graph neural networks for link prediction. NeurIPS, 13683–13694.
- Commonsense Knowledge Graph towards Super APP and Its Applications in Alipay. In SIGKDD. 5509–5519.
- Muhan Zhang and Yixin Chen. 2018. Link prediction based on graph neural networks. In NeurIPS.
- Learning from counterfactual links for link prediction. In ICML. 26911–26926.
- Cold brew: Distilling graph node representations with incomplete or missing neighborhoods. arXiv preprint arXiv:2111.04840 (2021).
- Predicting missing links via local information. The European Physical Journal B, 623–630.
- Neural bellman-ford networks: A general graph neural network framework for link prediction. In NeurIPS. 29476–29490.