LiGNN: Graph Neural Networks at LinkedIn (2402.11139v1)
Abstract: In this paper, we present LiGNN, a deployed large-scale Graph Neural Networks (GNNs) Framework. We share our insight on developing and deployment of GNNs at large scale at LinkedIn. We present a set of algorithmic improvements to the quality of GNN representation learning including temporal graph architectures with long term losses, effective cold start solutions via graph densification, ID embeddings and multi-hop neighbor sampling. We explain how we built and sped up by 7x our large-scale training on LinkedIn graphs with adaptive sampling of neighbors, grouping and slicing of training data batches, specialized shared-memory queue and local gradient optimization. We summarize our deployment lessons and learnings gathered from A/B test experiments. The techniques presented in this work have contributed to an approximate relative improvements of 1% of Job application hearing back rate, 2% Ads CTR lift, 0.5% of Feed engaged daily active users, 0.2% session lift and 0.1% weekly active user lift from people recommendation. We believe that this work can provide practical solutions and insights for engineers who are interested in applying Graph neural networks at large scale.
- adSformers: Personalization from Short-Term Sequences and Diversity of Representations in Etsy Ads. arXiv preprint arXiv:2302.01255 (2023).
- Scaling graph neural networks with approximate pagerank. In KDD.
- LANNS: a web-scale approximate nearest neighbor lookup system. arXiv preprint arXiv:2010.09426 (2020).
- Mixed precision training. https://doi.org/10.48550/arXiv.1710.03740
- Relational Deep Learning: Graph Representation Learning on Relational Databases. arXiv:cs.LG/2312.04615
- Grale: Designing Networks for Graph Learning. KDD.
- Inductive Representation Learning on Large Graphs. NIPS.
- MLPInit: Embarrassingly Simple GNN Training Acceleration with MLP Initialization. arXiv:cs.LG/2210.00102
- A Multi-Strategy-Based Pre-Training Method for Cold-Start Recommendation. ACM Trans. Inf. Syst. (2023).
- Pre-Training Graph Neural Networks for Cold-Start Users and Items Representation. arXiv:cs.IR/2012.07064
- How Powerful are Graph Neural Networks? ICLR.
- Alex Krizhevsky. 2014. One weird trick for parallelizing convolutional neural networks. https://doi.org/10.48550/arXiv.1404.5997
- Kubernetes. 2014. an open-source system for automating deployment. https://kubernetes.io.
- GraphCast: Learning skillful medium-range global weather forecasting.
- PyTorch-BigGraph: A Large-scale Graph Embedding System. arXiv:cs.LG/1903.12287
- Generating Wikipedia by Summarizing Long Sequences. In International Conference on Learning Representations.
- Uncertainty-Aware Consistency Learning for Cold-Start Item Recommendation. In SIGIR.
- Que2Search: Fast and Accurate Query and Document Understanding for Search at Facebook. KDD.
- Heterogeneous Graph Neural Networks for Large-Scale Bid Keyword Matching. CIKM.
- HUGE: Huge Unsupervised Graph Embeddings with TPUs. In KDD.
- PinnerFormer: Sequence Modeling for User Representation at Pinterest. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3702–3712.
- Attribute Graph Neural Networks for Strict Cold Start Recommendation. IEEE Transactions on Knowledge and Data Engineering (2022).
- Nxtpost: User to post recommendations in facebook groups. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3792–3800.
- Temporal graph networks for deep learning on dynamic graphs. arXiv preprint arXiv:2006.10637 (2020).
- Alex Samylkin. 2022. DeepGNN is a framework for training machine learning models on large scale graph data. https://github.com/microsoft/DeepGNN
- Graph Neural Networks for Friend Ranking in Large-Scale Social Platforms. KDD.
- Modeling Relational Data with Graph Convolutional Networks. In The Semantic Web.
- Foundations and modeling of dynamic networks using dynamic graph neural networks: A survey. IEEE Access 9 (2021), 79143–79168.
- Attention is all you need. Advances in neural information processing systems 30 (2017).
- FORA: Simple and Effective Approximate Single-Source Personalized PageRank. KDD.
- TopPPR: Top-k Personalized PageRank Queries with Precision Guarantees on Large Graphs. SIGMOD.
- Graph convolutional neural networks for web-scale recommender systems. In Proceedings of the 24th ACM SIGKDD international conference on knowledge discovery & data mining. 974–983.
- Da Zheng and Florian Saupe. 2023. Fast-track graph ML with GraphStorm: A new way to solve problems on enterprise-scale graphs. https://aws.amazon.com/blogs/machine-learning/fast-track-graph-ml-with-graphstorm-a-new-way-to-solve-problems-on-enterprise-scale-graphs/
- Fedor Borisyuk (13 papers)
- Shihai He (3 papers)
- Yunbo Ouyang (6 papers)
- Morteza Ramezani (5 papers)
- Peng Du (28 papers)
- Xiaochen Hou (6 papers)
- Chengming Jiang (3 papers)
- Nitin Pasumarthy (1 paper)
- Priya Bannur (2 papers)
- Birjodh Tiwana (4 papers)
- Ping Liu (93 papers)
- Siddharth Dangi (2 papers)
- Daqi Sun (3 papers)
- Zhoutao Pei (1 paper)
- Xiao Shi (9 papers)
- Sirou Zhu (6 papers)
- Qianqi Shen (2 papers)
- Kuang-Hsuan Lee (2 papers)
- David Stein (6 papers)
- Baolei Li (1 paper)