Graph Pre-Training Models Are Strong Anomaly Detectors (2410.18487v1)
Abstract: Graph Anomaly Detection (GAD) is a challenging and practical research topic where Graph Neural Networks (GNNs) have recently shown promising results. The effectiveness of existing GNNs in GAD has been mainly attributed to the simultaneous learning of node representations and the classifier in an end-to-end manner. Meanwhile, graph pre-training, the two-stage learning paradigm such as DGI and GraphMAE, has shown potential in leveraging unlabeled graph data to enhance downstream tasks, yet its impact on GAD remains under-explored. In this work, we show that graph pre-training models are strong graph anomaly detectors. Specifically, we demonstrate that pre-training is highly competitive, markedly outperforming the state-of-the-art end-to-end training models when faced with limited supervision. To understand this phenomenon, we further uncover pre-training enhances the detection of distant, under-represented, unlabeled anomalies that go beyond 2-hop neighborhoods of known anomalies, shedding light on its superior performance against end-to-end models. Moreover, we extend our examination to the potential of pre-training in graph-level anomaly detection. We envision this work to stimulate a re-evaluation of pre-training's role in GAD and offer valuable insights for future research.
- Charu C Aggarwal. 2013. Outlier ensembles: position paper. ACM SIGKDD Explorations Newsletter 14, 2 (2013), 49–58.
- Fake news, disinformation and misinformation in social media: a review. Social Network Analysis and Mining 13, 1 (2023), 30.
- Oddball: Spotting anomalies in weighted graphs. In Advances in Knowledge Discovery and Data Mining: 14th Pacific-Asia Conference, PAKDD 2010, Hyderabad, India, June 21-24, 2010. Proceedings. Part II 14. Springer, 410–421.
- Anomaly detection in online social network: A survey. In 2017 International Conference on Inventive Communication and Computational Technologies (ICICCT). IEEE, 456–459.
- Alessandro Bondielli and Francesco Marcelloni. 2019. A survey on fake news and rumour detection techniques. Information Sciences 497 (2019), 38–55.
- Protein function prediction via graph kernels. Bioinformatics 21, suppl_1 (2005), i47–i56.
- Chen Cai and Yusu Wang. 2018. A simple yet effective baseline for non-attributed graph classification. arXiv preprint arXiv:1811.03508 (2018).
- Wiener Graph Deconvolutional Network Improves Graph Self-Supervised Learning. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 7131–7139.
- Ailin Deng and Bryan Hooi. 2021. Graph neural network-based anomaly detection in multivariate time series. In Proceedings of the AAAI conference on artificial intelligence. 4027–4035.
- Deep anomaly detection on attributed networks. In Proceedings of the SDM. SIAM, 594–602.
- Enhancing graph neural network-based fraud detectors against camouflaged fraudsters. In CIKM. 315–324.
- Graph Anomaly Detection via Multi-Scale Contrastive Learning Networks with Augmented View. arXiv preprint arXiv:2212.00535 (2022).
- SLAPS: Self-supervision improves structure learning for graph neural networks, Vol. 34. 22667–22681.
- Detecting sensor faults, anomalies and outliers in the internet of things: A survey on the challenges and solutions. Electronics 9, 3 (2020), 511.
- Addressing Heterophily in Graph Anomaly Detection: A Perspective of Graph Spectrum. In Proceedings of the ACM Web Conference.
- Inductive representation learning on large graphs. In NeurIPS.
- Data Mining: Concepts and Techniques, 3rd edition. Morgan Kaufmann.
- ADBench: Anomaly detection benchmark. In Advances in Neural Information Processing Systems (NeurIPS).
- Kaveh Hassani and Amir Hosein Khasahmadi. 2020. Contrastive multi-view representation learning on graphs. In International conference on machine learning. PMLR, 4116–4126.
- Bayesian anomaly detection methods for social networks. The Annals of Applied Statistics 4, 2 (2010), 645–662.
- Financial fraud: a review of anomaly detection techniques and recent advances. Expert systems With applications 193 (2022), 116429.
- Geoffrey E Hinton and Richard Zemel. 1993. Autoencoders, minimum description length and Helmholtz free energy. Advances in neural information processing systems 6 (1993).
- GraphMAE2: A Decoding-Enhanced Masked Self-Supervised Graph Learner. In Proceedings of the ACM Web Conference 2023. 737–746.
- Graphmae: Self-supervised masked graph autoencoders. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 594–604.
- Dgraph: A large-scale financial dataset for graph anomaly detection. In Thirty-sixth Conference on Neural Information Processing Systems Datasets and Benchmarks Track.
- Derivation and validation of toxicophores for mutagenicity prediction. Journal of medicinal chemistry 48, 1 (2005), 312–320.
- Thomas N Kipf and Max Welling. 2016. Variational Graph Auto-Encoders. In NeurIPS Workshop on Bayesian Deep Learning.
- Thomas N. Kipf and Max Welling. 2017. Semi-Supervised Classification with Graph Convolutional Networks. In ICLR.
- Predicting dynamic embedding trajectory in temporal interaction networks. In Proceedings of the 25th ACM SIGKDD international conference on knowledge discovery & data mining. 1269–1278.
- DAGAD: Data Augmentation for Graph Anomaly Detection. arXiv preprint arXiv:2210.09766 (2022).
- BOND: Benchmarking Unsupervised Outlier Node Detection on Static Attributed Graphs. In Advances in Neural Information Processing Systems, Vol. 35.
- Pick and Choose: A GNN-based Imbalanced Learning Approach for Fraud Detection. In Proceedings of the Web Conference 2021.
- Graph self-supervised learning: A survey. IEEE Transactions on Knowledge and Data Engineering (2022).
- Mul-GAD: a semi-supervised graph anomaly detection framework via aggregating multi-view information. arXiv preprint arXiv:2212.05478 (2022).
- Revisiting Graph Contrastive Learning for Anomaly Detection. arXiv preprint arXiv:2305.02496 (2023).
- Geniepath: Graph neural networks with adaptive receptive paths. In Proceedings of the AAAI Conference on Artificial Intelligence. 4424–4431.
- Alleviating the inconsistency problem of applying graph neural network to fraud detection. In SIGIR. 1569–1572.
- Deep graph-level anomaly detection by glocal knowledge distillation. In Proceedings of the Fifteenth ACM International Conference on Web Search and Data Mining. 704–714.
- A comprehensive survey on graph anomaly detection with deep learning. IEEE Transactions on Knowledge and Data Engineering (2021).
- Towards graph-level anomaly detection via deep evolutionary mapping. In Proceedings of the 29th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 1631–1642.
- Julian John McAuley and Jure Leskovec. 2013. From amateurs to connoisseurs: modeling the evolution of user expertise through online reviews. In WWW.
- TUDataset: A collection of benchmark datasets for learning with graphs. In Proceedings of the ICML Workshop on Graph Representation Learning and Beyond. 1–11.
- Caleb C Noble and Diane J Cook. 2003. Graph-based anomaly detection. In KDD.
- Adversarially regularized graph autoencoder for graph embedding. In IJCAI.
- A critical look at the evaluation of GNNs under heterophily: are we really making progress?. In ICLR.
- Raising the Bar in Graph-level Anomaly Detection. https://arxiv.org/abs/2205.13845
- Shebuti Rayana and Leman Akoglu. 2015. Collective opinion spam detection: Bridging review networks and metadata. In Proceedings of the 21th acm sigkdd international conference on knowledge discovery and data mining. 985–994.
- Kaspar Riesen and Horst Bunke. 2008. IAM graph database repository for graph based pattern recognition and machine learning. In Structural, Syntactic, and Statistical Pattern Recognition: Joint IAPR International Workshop, SSPR & SPR 2008, Orlando, USA, December 4-6, 2008. Proceedings. Springer, 287–297.
- Frank Rosenblatt. 1958. The perceptron: a probabilistic model for information storage and organization in the brain. Psychological review 65, 6 (1958), 386.
- Weisfeiler-lehman graph kernels. Journal of Machine Learning Research 12, 9 (2011).
- S2GAE: Self-Supervised Graph Autoencoders are Generalizable Learners with Graph Masking. In Proceedings of the Sixteenth ACM International Conference on Web Search and Data Mining. 787–795.
- GADBench: Revisiting and Benchmarking Supervised Graph Anomaly Detection. In Thirty-seventh Conference on Neural Information Processing Systems Datasets and Benchmarks Track.
- Rethinking Graph Neural Networks for Anomaly Detection. In International Conference on Machine Learning.
- Large-Scale Representation Learning on Graphs via Bootstrapping. In ICLR.
- Graph attention networks. In ICLR.
- Deep graph infomax. In ICLR.
- A semi-supervised graph attentive network for financial fraud detection. In ICDM. IEEE, 598–607.
- Decoupling representation learning and classification for gnn-based anomaly detection. In Proceedings of the 44th International ACM SIGIR Conference on Research and Development in Information Retrieval. 1239–1248.
- Anti-money laundering in bitcoin: Experimenting with graph convolutional networks for financial forensics. arXiv preprint arXiv:1908.02591 (2019).
- Self-supervised learning of graph neural networks: A unified review. IEEE Transactions on Pattern Analysis and Machine Intelligence (2022).
- How powerful are graph neural networks? ICLR (2019).
- Pinar Yanardag and SVN Vishwanathan. 2015. Deep graph kernels. In Proceedings of the 21th ACM SIGKDD international conference on knowledge discovery and data mining. 1365–1374.
- FRAUDRE: Fraud Detection Dual-Resistant to Graph Inconsistency and Imbalance. In 2021 IEEE International Conference on Data Mining (ICDM). IEEE, 867–876.
- Dual-discriminative graph neural network for imbalanced graph-level anomaly detection. Advances in Neural Information Processing Systems 35 (2022), 24144–24157.
- From canonical correlation analysis to self-supervised graph neural networks. In NeurIPS.
- COSTA: covariance-preserving feature augmentation for graph contrastive learning. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 2524–2534.
- Spectral feature augmentation for graph contrastive learning and beyond. In Proceedings of the AAAI Conference on Artificial Intelligence, Vol. 37. 11289–11297.
- Lingxiao Zhao and Leman Akoglu. 2021. On using classification datasets to evaluate graph outlier detection: Peculiar observations and new insights. Big Data (2021).
- Error-Bounded Graph Anomaly Loss for GNNs. In CIKM.
- AddGraph: Anomaly Detection in Dynamic Graph Using Attention-based Temporal GCN.. In IJCAI. 4419–4425.
- Beyond homophily in graph neural networks: Current limitations and effective designs. Advances in Neural Information Processing Systems 33 (2020).
- Deep Graph Contrastive Representation Learning. In ICML Workshop on Graph Representation Learning and Beyond.