Embedding Compression with Hashing for Efficient Representation Learning in Large-Scale Graph (2208.05648v1)

Published 11 Aug 2022 in cs.LG, cs.AI, and cs.DB

Abstract: Graph neural networks (GNNs) are deep learning models designed specifically for graph data, and they typically rely on node features as the input to the first layer. When applying such a type of network on the graph without node features, one can extract simple graph-based node features (e.g., number of degrees) or learn the input node representations (i.e., embeddings) when training the network. While the latter approach, which trains node embeddings, more likely leads to better performance, the number of parameters associated with the embeddings grows linearly with the number of nodes. It is therefore impractical to train the input node embeddings together with GNNs within graphics processing unit (GPU) memory in an end-to-end fashion when dealing with industrial-scale graph data. Inspired by the embedding compression methods developed for NLP tasks, we develop a node embedding compression method where each node is compactly represented with a bit vector instead of a floating-point vector. The parameters utilized in the compression method can be trained together with GNNs. We show that the proposed node embedding compression method achieves superior performance compared to the alternatives.

PDF Abstract

Summarize Bookmark Chat (Pro)

Authors (9)

Chin-Chia Michael Yeh (43 papers)
Mengting Gu (4 papers)
Yan Zheng (102 papers)
Huiyuan Chen (43 papers)
Javid Ebrahimi (7 papers)
Zhongfang Zhuang (32 papers)
Junpeng Wang (53 papers)
Liang Wang (512 papers)
Wei Zhang (1489 papers)

Citations (16)

View on Semantic Scholar

Embedding Compression with Hashing for Efficient Representation Learning in Large-Scale Graph (2208.05648v1)

Related Papers