Theoretical and Empirical Insights into the Origins of Degree Bias in Graph Neural Networks (2404.03139v2)
Abstract: Graph Neural Networks (GNNs) often perform better for high-degree nodes than low-degree nodes on node classification tasks. This degree bias can reinforce social marginalization by, e.g., privileging celebrities and other high-degree actors in social networks during social and content recommendation. While researchers have proposed numerous hypotheses for why GNN degree bias occurs, we find via a survey of 38 degree bias papers that these hypotheses are often not rigorously validated, and can even be contradictory. Thus, we provide an analysis of the origins of degree bias in message-passing GNNs with different graph filters. We prove that high-degree test nodes tend to have a lower probability of misclassification regardless of how GNNs are trained. Moreover, we show that degree bias arises from a variety of factors that are associated with a node's degree (e.g., homophily of neighbors, diversity of neighbors). Furthermore, we show that during training, some GNNs may adjust their loss on low-degree nodes more slowly than on high-degree nodes; however, with sufficiently many epochs of training, message-passing GNNs can achieve their maximum possible training accuracy, which is not significantly limited by their expressive power. Throughout our analysis, we connect our findings to previously-proposed hypotheses for the origins of degree bias, supporting and unifying some while drawing doubt to others. We validate our theoretical findings on 8 common real-world networks, and based on our theoretical and empirical insights, describe a roadmap to alleviate degree bias.
- Towards a unified framework for fair and stable graph representation learning. 2021.
- Albert-László Barabási. Network science. Philosophical Transactions of the Royal Society A: Mathematical, Physical and Engineering Sciences, 371, 2016.
- Deep gaussian embedding of graphs: Unsupervised inductive learning via ranking. In International Conference on Learning Representations, 2018.
- Implications of topological imbalance for representation learning on biomedical knowledge graphs. Briefings in Bioinformatics, 23(5):bbac279, 07 2022.
- How graph convolutions amplify popularity bias for recommendation? Frontiers of Computer Science, 18(5), December 2023.
- Simple and deep graph convolutional networks. In Hal Daumé III and Aarti Singh, editors, Proceedings of the 37th International Conference on Machine Learning, volume 119 of Proceedings of Machine Learning Research, pages 1725–1735. PMLR, 13–18 Jul 2020.
- Anchor-enhanced geographical entity representation learning. IEEE Transactions on Neural Networks and Learning Systems, pages 1–15, 2023.
- Ba-gnn: On learning bias-aware graph neural network. In 2022 IEEE 38th International Conference on Data Engineering (ICDE), pages 3012–3024, 2022.
- Contextual stochastic block models. In S. Bengio, H. Wallach, H. Larochelle, K. Grauman, N. Cesa-Bianchi, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 31. Curran Associates, Inc., 2018.
- Inductive dummy-based homogeneous neighborhood augmentation for graph collaborative filtering. In 2023 International Joint Conference on Neural Networks (IJCNN), pages 1–8, 2023.
- Studying the effect of GNN spatial convolutions on the embedding space’s geometry. In Robin J. Evans and Ilya Shpitser, editors, Proceedings of the Thirty-Ninth Conference on Uncertainty in Artificial Intelligence, volume 216 of Proceedings of Machine Learning Research, pages 539–548. PMLR, 31 Jul–04 Aug 2023.
- Brian Everitt. The Cambridge dictionary of statistics. Cambridge University Press, Cambridge, UK; New York, 2002.
- Should graph convolution trust neighbors? a simple causal inference method. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’21, page 1208–1218, New York, NY, USA, 2021. Association for Computing Machinery.
- Fast graph representation learning with PyTorch Geometric. In ICLR Workshop on Representation Learning on Graphs and Manifolds, 2019.
- Understanding convolution on graphs via energies. Transactions on Machine Learning Research, 2023.
- Inductive representation learning on large graphs. In Proceedings of the 31st International Conference on Neural Information Processing Systems, NIPS’17, page 1025–1035, Red Hook, NY, USA, 2017. Curran Associates Inc.
- Towards label position bias in graph neural networks. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- Marginal nodes matter: Towards structure fairness in graphs, 2023.
- Mitigating degree biases in message passing mechanism by utilizing community structures, 2023.
- Graphpatcher: Mitigating degree bias for graph neural networks via test-time augmentation. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- Rawlsgcn: Towards rawlsian difference principle on graph convolutional network. In Proceedings of the ACM Web Conference 2022, WWW ’22, page 1214–1225, New York, NY, USA, 2022. Association for Computing Machinery.
- Adam: A method for stochastic optimization. CoRR, abs/1412.6980, 2014.
- Semi-supervised classification with graph convolutional networks. In International Conference on Learning Representations, 2017.
- On dyadic fairness: Exploring and mitigating bias in graph connections. In International Conference on Learning Representations, 2021.
- A metadata-driven approach to understand graph neural networks. In Thirty-seventh Conference on Neural Information Processing Systems, 2023.
- An explicitly weighted gcn aggregator based on temporal and popularity features for recommendation. ACM Trans. Recomm. Syst., 1(2), apr 2023.
- Tackling long-tailed distribution issue in graph neural networks via normalization. IEEE Transactions on Knowledge and Data Engineering, pages 1–11, 2023.
- Sailor: Structural augmentation based tail node representation learning. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, CIKM ’23, page 1389–1399, New York, NY, USA, 2023. Association for Computing Machinery.
- Local augmentation for graph neural networks. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, and Sivan Sabato, editors, Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pages 14054–14072. PMLR, 17–23 Jul 2022.
- Trade less accuracy for fairness and trade-off explanation for gnn. In 2022 IEEE International Conference on Big Data (Big Data), pages 4681–4690, 2022.
- Locality-aware tail node embeddings on homogeneous and heterogeneous networks. IEEE Transactions on Knowledge and Data Engineering, pages 1–16, 2023.
- Tail-gnn: Tail-node graph neural networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, KDD ’21, page 1109–1119, New York, NY, USA, 2021. Association for Computing Machinery.
- On generalized degree fairness in graph neural networks. In Proceedings of the Thirty-Seventh AAAI Conference on Artificial Intelligence and Thirty-Fifth Conference on Innovative Applications of Artificial Intelligence and Thirteenth Symposium on Educational Advances in Artificial Intelligence, AAAI’23/IAAI’23/EAAI’23. AAAI Press, 2023.
- Is homophily a necessity for graph neural networks? In International Conference on Learning Representations, 2022.
- PyTorch: An Imperative Style, High-Performance Deep Learning Library. Curran Associates Inc., Red Hook, NY, USA, 2019.
- Multi-Scale attributed node embedding. Journal of Complex Networks, 9(2):cnab014, 05 2021.
- Pitfalls of graph neural network evaluation. ArXiv, abs/1811.05868, 2018.
- Toward degree bias in embedding-based knowledge graph completion. In Proceedings of the ACM Web Conference 2023, WWW ’23, page 705–715, New York, NY, USA, 2023. Association for Computing Machinery.
- Networked inequality: Preferential attachment bias in graph neural network link prediction. In NeurIPS 2023 Workshop: New Frontiers in Graph Learning, 2023.
- Investigating and mitigating degree-related biases in graph convoltuional networks. In Proceedings of the 29th ACM International Conference on Information & Knowledge Management, CIKM ’20, page 1435–1444, New York, NY, USA, 2020. Association for Computing Machinery.
- Graph attention networks. In International Conference on Learning Representations, 2018.
- Blade: Biased neighborhood sampling based graph neural network for directed graphs. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining, WSDM ’23, page 42–50, New York, NY, USA, 2023. Association for Computing Machinery.
- Uncovering the structural fairness in graph contrastive learning. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors, Advances in Neural Information Processing Systems, 2022.
- Meta graph learning for long-tail recommendation. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD ’23, page 2512–2522, New York, NY, USA, 2023. Association for Computing Machinery.
- Simplifying graph convolutional networks, 2019.
- Self-supervised graph learning for recommendation. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval, SIGIR ’21, page 726–735, New York, NY, USA, 2021. Association for Computing Machinery.
- Demo-net: Degree-specific graph neural networks for node and graph classification. In Proceedings of the 25th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, KDD ’19, page 406–415, New York, NY, USA, 2019. Association for Computing Machinery.
- Learning how to propagate messages in graph neural networks. In Proceedings of the 27th ACM SIGKDD Conference on Knowledge Discovery & Data Mining, KDD ’21, page 1894–1903, New York, NY, USA, 2021. Association for Computing Machinery.
- Grace: Graph self-distillation and completion to mitigate degree-related biases. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining, KDD ’23, page 2813–2824, New York, NY, USA, 2023. Association for Computing Machinery.
- Hyncf: A hybrid normalization strategy via feature statistics for collaborative filtering. Expert Systems with Applications, 238:121875, 2024.
- How powerful are graph neural networks? In International Conference on Learning Representations, 2019.
- Representation learning on graphs with jumping knowledge networks. In Jennifer Dy and Andreas Krause, editors, Proceedings of the 35th International Conference on Machine Learning, volume 80 of Proceedings of Machine Learning Research, pages 5453–5462. PMLR, 10–15 Jul 2018.
- Lte4g: Long-tail experts for graph neural networks. In Proceedings of the 31st ACM International Conference on Information & Knowledge Management, CIKM ’22, page 2434–2443, New York, NY, USA, 2022. Association for Computing Machinery.
- Incorporating bias-aware margins into contrastive loss for collaborative filtering. Advances in Neural Information Processing Systems, 35:7866–7878, 2022.
- Learning fair representations via rebalancing graph structure. Information Processing & Management, 61(1):103570, 2024.
- Rumour detection on social media with long-tail strategy. In 2022 International Joint Conference on Neural Networks (IJCNN), pages 1–8, 2022.
- Bayesian graph local extrema convolution with long-tail strategy for misinformation detection. ACM Trans. Knowl. Discov. Data, jan 2024. Just Accepted.
- Bilateral filtering graph convolutional network for multi-relational social recommendation in the power-law networks. ACM Trans. Inf. Syst., 40(2), sep 2021.
- Dahgn: Degree-aware heterogeneous graph neural network. Knowledge-Based Systems, 285:111355, 2024.
- Self-supervised graph attention collaborative filtering for recommendation. Electronics, 12(4), 2023.
- Markus Zopf. 1-wl expressiveness is (almost) all you need. In 2022 International Joint Conference on Neural Networks (IJCNN), pages 1–8. IEEE, 2022.
- Arjun Subramonian (22 papers)
- Jian Kang (142 papers)
- Yizhou Sun (149 papers)