AliGraph: A Comprehensive Graph Neural Network Platform (1902.08730v1)

Published 23 Feb 2019 in cs.DC

Abstract: An increasing number of machine learning tasks require dealing with large graph datasets, which capture rich and complex relationship among potentially billions of elements. Graph Neural Network (GNN) becomes an effective way to address the graph learning problem by converting the graph data into a low dimensional space while keeping both the structural and property information to the maximum extent and constructing a neural network for training and referencing. However, it is challenging to provide an efficient graph storage and computation capabilities to facilitate GNN training and enable development of new GNN algorithms. In this paper, we present a comprehensive graph neural network system, namely AliGraph, which consists of distributed graph storage, optimized sampling operators and runtime to efficiently support not only existing popular GNNs but also a series of in-house developed ones for different scenarios. The system is currently deployed at Alibaba to support a variety of business scenarios, including product recommendation and personalized search at Alibaba's E-Commerce platform. By conducting extensive experiments on a real-world dataset with 492.90 million vertices, 6.82 billion edges and rich attributes, AliGraph performs an order of magnitude faster in terms of graph building (5 minutes vs hours reported from the state-of-the-art PowerGraph platform). At training, AliGraph runs 40%-50% faster with the novel caching strategy and demonstrates around 12 times speed up with the improved runtime. In addition, our in-house developed GNN models all showcase their statistically significant superiorities in terms of both effectiveness and efficiency (e.g., 4.12%-17.19% lift by F1 scores).

Citations (363)

View on Semantic Scholar

Summary

The paper presents AliGraph, a comprehensive GNN platform that enhances large-scale graph processing and accelerates training speeds by up to 50%.
It introduces innovative components, including distributed graph storage and advanced sampling operators, to improve efficiency and scalability.
Experimental evaluations on real-world datasets show significant F1 score gains and up to 12-fold performance improvements over existing systems.

Overview of AliGraph: A Comprehensive Graph Neural Network Platform

The paper presents AliGraph, an advanced Graph Neural Network (GNN) platform addressing challenges associated with large-scale graph datasets. Acknowledging the limitations of existing GNN systems in terms of efficient storage and computation, AliGraph is designed to optimize graph storage and augment the development of novel GNN algorithms. This platform finds practical applications in various business scenarios at Alibaba, such as product recommendation and personalized search.

AliGraph stands out with its innovative distributed graph storage, optimized sampling operators, and an enhanced runtime environment. These components collectively contribute to its significant performance improvements over existing platforms, such as PowerGraph. Notably, AliGraph accomplishes graph construction tasks in a fraction of the time required by other platforms and enhances GNN training efficiency by implementing a novel caching strategy and an improved runtime, leading to 40-50% faster training speeds and performance enhancements by up to 12 times.

Key Contributions of AliGraph

Distributed Graph Storage: AliGraph utilizes a partitioned storage mechanism to handle massive graphs efficiently. This approach leverages structural and attribute-specific storage methods, facilitating rapid data access even in distributed environments.
Advanced Sampling Mechanisms: The platform introduces three types of samplers—traverse, neighborhood, and negative—crucial in enhancing the scalability and accuracy of GNNs. Implementing lock-free methods ensures efficient sampling in a distributed context.
Optimized Operators: By introducing advanced strategies for caching intermediate results during aggregation and combination operations, AliGraph achieves substantial reductions in computational costs. These optimizations are pivotal in the platform's superior training efficiencies.
Algorithmic Flexibility: AliGraph supports a wide range of GNN algorithms, enabling the easy integration of existing methods and the development of novel in-house algorithms. The flexibility in designing GNN algorithms highlights the platform’s adaptability to varied practical requirements.

Experimental Evaluation

Experiments conducted on a large-scale real-world dataset from Taobao demonstrate the superior performance of AliGraph. It effectively manages datasets comprising millions of vertices and billions of edges, showcasing drastic improvements in graph building times and operational efficiency. AliGraph's in-house GNN models exhibit enhancements of 4.12% to 17.19% in F1 scores, underscoring their efficacy and robustness compared to state-of-the-art methods.

Implications and Future Directions

The introduction of AliGraph has significant implications for the field of AI, particularly in domains requiring the extraction of intricate insights from large and complex graph data. Its deployment within Alibaba suggests substantial practical benefits in commercial applications, which can be extended to varied industrial contexts.

Theoretically, AliGraph opens avenues for exploring edge-specific and subgraph-level embeddings, potentially advancing the understanding and application of GNNs in dynamic and heterogeneous data environments. Future developments in AliGraph could explore additional execution optimizations, auto-ML for algorithm selection, and early-stop mechanisms to streamline training processes further.

In summary, AliGraph signifies a major step forward in the application of GNNs to tackle complex real-world problems, offering both academic researchers and industry professionals a powerful tool to leverage the full potential of graph-based data analysis.

PDF Markdown