Graph Condensation: A Survey (2401.11720v2)
Abstract: The rapid growth of graph data poses significant challenges in storage, transmission, and particularly the training of graph neural networks (GNNs). To address these challenges, graph condensation (GC) has emerged as an innovative solution. GC focuses on synthesizing a compact yet highly representative graph, enabling GNNs trained on it to achieve performance comparable to those trained on the original large graph. The notable efficacy of GC and its broad prospects have garnered significant attention and spurred extensive research. This survey paper provides an up-to-date and systematic overview of GC, organizing existing research into five categories aligned with critical GC evaluation criteria: effectiveness, generalization, efficiency, fairness, and robustness. To facilitate an in-depth and comprehensive understanding of GC, this paper examines various methods under each category and thoroughly discusses two essential components within GC: optimization strategies and condensed graph generation. We also empirically compare and analyze representative GC methods with diverse optimization strategies based on the five proposed GC evaluation criteria. Finally, we explore the applications of GC in various fields, outline the related open-source libraries, and highlight the present challenges and novel insights, with the aim of promoting advancements in future research. The related resources can be found at https://github.com/XYGaoG/Graph-Condensation-Papers.
- Graph based anomaly detection and description: a survey. Data mining and knowledge discovery, 29:626–688, 2015.
- Anonymous. CTRL: Graph condensation via crafting rational trajectory matching. In Openview, 2023.
- Digraphs: theory, algorithms and applications. Springer Science & Business Media, 2008.
- A survey on spectral graph neural networks. arXiv preprint arXiv:2302.05631, 2023.
- An entity event knowledge graph for human resources management in public administration: the case of education personnel. In 2023 IEEE 25th Conference on Business Informatics (CBI), pages 1–8. IEEE, 2023.
- A unifying framework for spectrum-preserving graph sparsification and coarsening. Advances in Neural Information Processing Systems, 32, 2019.
- Dataset distillation by matching training trajectories. In CVPR, pages 4750–4759, 2022.
- Faster hyperparameter search for GNNs via calibrated dataset condensation. arXiv, 2022.
- Graph neural tangent kernel: Fusing graph neural networks with graph kernels. In NeurIPS, 2019.
- Graph lifelong learning: A survey. IEEE Computational Intelligence Magazine, 18(1):32–51, 2023.
- Fair graph distillation. In NeurIPS, 2023.
- Federated graph machine learning: A survey of concepts, techniques, and applications. ACM SIGKDD Explorations Newsletter, 24(2):32–47, 2022.
- Multiple sparse graphs condensation. Knowledge-Based Systems, 2023.
- Graph neural architecture search. In International joint conference on artificial intelligence. International Joint Conference on Artificial Intelligence, 2021.
- Semantic-aware node synthesis for imbalanced heterogeneous information networks. In Proceedings of the 32nd ACM International Conference on Information and Knowledge Management, pages 545–555, 2023.
- Accelerating scalable graph neural network inference with node-adaptive propagation. arXiv preprint arXiv:2310.10998, 2023.
- Graph condensation for inductive node representation learning. In ICDE, 2024.
- A survey on dataset distillation: Approaches, applications and future directions. In IJCAI, 2023.
- A kernel two-sample test. JMLR, 2012.
- Graph-based molecular representation learning. arXiv preprint arXiv:2207.04869, 2022.
- Inductive representation learning on large graphs. In Advances in Neural Information Processing Systems 30: Annual Conference on Neural Information Processing Systems 2017, December 4-9, 2017, Long Beach, CA, USA, pages 1024–1034, 2017.
- Condensing graphs via one-step gradient matching. In SIGKDD, 2022.
- Graph condensation for graph neural networks. In ICLR, 2022.
- Irie: Scalable and robust influence maximization in social networks. In 2012 IEEE 12th International Conference on Data Mining, pages 918–923. IEEE, 2012.
- Representation learning for dynamic graphs: A survey. The Journal of Machine Learning Research, 21(1):2648–2720, 2020.
- Link prediction techniques, applications, and performance: A survey. Physica A: Statistical Mechanics and its Applications, 553:124289, 2020.
- A comprehensive survey of dataset distillation. TPAMI, 2024.
- Knowledge mining and social dangerousness assessment in criminal justice: metaheuristic integration of machine learning and graph-based inference. Artificial Intelligence and Law, 31(4):653–702, 2023.
- Influence maximization on social graphs: A survey. IEEE Transactions on Knowledge and Data Engineering, 30(10):1852–1872, 2018.
- Attend who is weak: Enhancing graph condensation via cross-free adversarial training. arXiv, 2023.
- Graph condensation via receptive field distribution matching. arXiv, 2022.
- Graph condensation via eigenbasis matching. arXiv, 2023.
- CaT: Balanced continual graph learning with graph condensation. In ICDM, 2023.
- PUMA: Efficient continual graph learning with graph condensation. arXiv, 2023.
- Spectrally approximating large graphs with smaller graphs. In Proceedings of the 35th International Conference on Machine Learning, ICML 2018, 2018.
- The concrete distribution: A continuous relaxation of discrete random variables. In ICLR, 2016.
- GCARe: Mitigating subgroup unfairness in graph condensation through adversarial regularization. Applied Sciences, 2023.
- A survey on bias and fairness in machine learning. ACM computing surveys (CSUR), 2021.
- Dataset meta-learning from kernel ridge-regression. In ICLR, 2020.
- Graph neural architecture search: A survey. Tsinghua Science and Technology, 27(4):692–708, 2021.
- FedGKD: Unleashing the power of collaboration in federated graph neural networks. arXiv, 2023.
- Graph neural networks for intelligent transportation systems: A survey. IEEE Transactions on Intelligent Transportation Systems, 2023.
- Data distillation: A survey. TMLR, 2023.
- Infinite recommendation networks: A data-centric approach. Advances in Neural Information Processing Systems, 35:31292–31305, 2022.
- A comprehensive survey on graph summarization with graph neural networks. IEEE Transactions on Artificial Intelligence, 2024.
- Graph sparsification by effective resistances. In Proceedings of the fortieth annual ACM symposium on Theory of computing, pages 563–568, 2008.
- Adversarial attack and defense on graph data: A survey. IEEE Transactions on Knowledge and Data Engineering, 2022.
- Recent advances on graph analytics and its applications in healthcare. In Proceedings of the 26th ACM SIGKDD International Conference on Knowledge Discovery & Data Mining, pages 3545–3546, 2020.
- A review on graph neural network methods in financial applications. arXiv preprint arXiv:2111.15367, 2021.
- Fast graph condensation with structure-based neural tangent kernel. arXiv, 2023.
- Simplifying graph convolutional networks. In ICML, 2019.
- Graph neural networks in recommender systems: a survey. ACM Computing Surveys, 55(5):1–37, 2022.
- Graph learning: A survey. IEEE Transactions on Artificial Intelligence, 2(2):109–127, 2021.
- Kernel ridge regression-based graph dataset distillation. In SIGKDD, 2023.
- Does graph distillation see like vision dataset counterpart? In NeurIPS, 2023.
- Data-centric graph learning: A survey. arXiv, 2023.
- Self-supervised learning for recommender systems: A survey. IEEE Transactions on Knowledge and Data Engineering, 2023.
- Dataset distillation: A comprehensive review. TPAMI, 2023.
- Explainability in graph neural networks: A taxonomic survey. IEEE transactions on pattern analysis and machine intelligence, 45(5):5782–5799, 2022.
- A survey on graph neural network acceleration: Algorithms, systems, and customized hardware. arXiv, 2023.
- Bo Zhao and Hakan Bilen. Dataset condensation with distribution matching. In WACV, 2023.
- Dataset condensation with gradient matching. In ICLR, 2020.
- Graph neural networks for graphs with heterophily: A survey. arXiv preprint arXiv:2202.07082, 2022.
- Automl for deep recommender systems: A survey. ACM Transactions on Information Systems, 41(4):1–38, 2023.
- Towards data-centric graph machine learning: Review and outlook. arXiv, 2023.
- Structure-free graph condensation: From large-scale graphs to condensed graph-free data. In NeurIPS, 2023.
- Interpreting and unifying graph neural networks with an optimization framework. In Proceedings of the Web Conference 2021, pages 1215–1226, 2021.
- A survey on deep graph generation: Methods and applications. In Learning on Graphs Conference, pages 47–1. PMLR, 2022.