AutoGraphAD: Unsupervised Graph NIDS

Updated 28 November 2025

AutoGraphAD is an unsupervised anomaly detection framework that leverages a heterogeneous variational graph autoencoder to process NetFlow data as temporal, bipartite graphs.
It constructs graphs by linking connection-flow nodes with IP nodes, enabling localized anomaly scoring in fixed time windows while bypassing the need for labeled data.
The framework optimizes reconstruction and KL-divergence losses through grid search on hyperparameters, achieving high throughput and robust detection performance.

AutoGraphAD is an unsupervised anomaly detection framework for network intrusion detection systems (NIDS), leveraging a Heterogeneous Variational Graph Autoencoder (VGAE) architecture to process NetFlow data as temporal heterogeneous graphs. Unlike supervised approaches, AutoGraphAD is designed to obviate the need for labelled datasets and downstream anomaly detectors by producing end-to-end anomaly scores based on reconstruction and latent space regularization (Anyfantis et al., 21 Nov 2025).

1. Graph Construction from Network Flows

AutoGraphAD operates on graphs constructed from raw NetFlow records within fixed time windows of length $\Delta = 180$ seconds. For each time window $t$ , a heterogeneous graph $G^{(t)} = (V, E)$ is built with:

Node types:
- $C = \{C_i^{(t)}\}$ : connection-flow nodes, where each node represents a single network flow and has $d_c = 35$ features (e.g., bytes, packets, durations, flags).
- $I = \{IP_j\}$ : IP-address nodes, one for each distinct IP seen in the window, typically with placeholder or low-dimensional features (e.g., one-hot encoding for IP version).
Edges:
- Each connection node $C_i^{(t)}$ is linked to its source and destination $IP$ nodes, with no direct $C$ - $C$ or $IP$ - $IP$ connections.
Mathematical representation:
- Flow node features: $X_C \in \mathbb{R}^{|C| \times d_c}$
- IP node features: $X_I \in \mathbb{R}^{|I| \times d_i}$ (low-dimensional)
- Biadjacency matrix: $A \in \{0,1\}^{|V| \times |V|}$ expresses $C$ - $I$ relations

This structure explicitly encodes the bipartite flow–IP interaction within each snapshot, facilitating localized reasoning over short timescales.

2. Model Architecture: Heterogeneous Variational Graph Autoencoder

AutoGraphAD employs a heterogeneous VGAE customized for bipartite ( $C$ , $I$ ) graphs, reconstructing both network structure and node attributes.

Encoder: A GNN (e.g., GraphSAGE, GCN) processes masked input graphs (masking applied only to $C$ $C$ nodes' features).
- Generates node-level variational parameters: $\mu_v \in \mathbb{R}^k$ , $\log \sigma_v \in \mathbb{R}^k$ for all $v \in V$ .
- Latent embedding via reparametrization: $z_v = \mu_v + \sigma_v \odot \epsilon$ , with $\epsilon \sim \mathcal{N}(0,I)$ and $q(z_v|X,A) = \mathcal{N}(z_v;\mu_v,\operatorname{diag}(\sigma_v^2))$ per node.
Decoder:
- Structure reconstruction: Predicted adjacency $\hat{A} = Z W Z^T$ , $Z \in \mathbb{R}^{|V| \times k}$ , $W \in \mathbb{R}^{k \times k}$ (optionally per edge type).
- Feature reconstruction: On $C$ -nodes only, a lightweight GNN-decoder $H_\theta$ maps $Z_C$ and $\hat{A}$ to $\hat{X}_C = H_\theta(Z_C, \hat{A})$ .
Contrastive component: Negative-edge sampling during training introduces non-edge pairs at a ratio $\eta_\mathrm{neg}$ ; the model is trained to output low probabilities on such non-edges, implemented via binary cross-entropy loss over edge and non-edge sets.

This architecture supports edge reconstruction and flow feature imputation in a unified variational framework.

3. Losses, Training Objectives, and Anomaly Scoring

Reconstruction losses:
- Structure: $\mathcal{L}_\mathrm{struct} = \operatorname{BCE}(\hat{A},A) = -\sum_{i<j}\left[ A_{ij} \log \sigma (\hat{A}_{ij}) + (1-A_{ij}) \log (1 - \sigma(\hat{A}_{ij})) \right]$ .
- Features (on $C$ ): $\mathcal{L}_\mathrm{feat} = \|X_C - \hat{X}_C\|_2^2$ (MSE) or $1 - \cos(X_C, \hat{X}_C)$ (cosine embedding), chosen per model variant.
KL-divergence: $\mathcal{D}_\mathrm{KL} = \frac{1}{2} \sum_{v} \left[ \mu_v^2 + \sigma_v^2 - 1 - \log \sigma_v^2 \right]$ .

The total loss is: $\mathcal{L}_\mathrm{total} = \alpha\,\mathcal{L}_\mathrm{struct} + \beta\,\mathcal{L}_\mathrm{feat} + \mathcal{D}_\mathrm{KL}$ with $\alpha,\beta > 0$ controlling relative weightings.

Node-level anomaly scoring at inference:

For each connection node $C_i$ ,

$S_i = \alpha\,\ell_{\mathrm{feat},i} + \beta\,\ell_{\mathrm{struct},i} + \gamma\,d_{\mathrm{KL},i}$

where each term corresponds to the node-specific feature/structure residual and KL penalty, and $\gamma$ is an additional weight. Predicted scores $S_i$ are scaled (RobustScaler), and a percentile threshold $\tau$ is selected (via grid search) so that $S_i > \tau$ flags an anomaly.

4. Training Procedure and Hyperparameter Choices

AutoGraphAD is trained with AdamW (learning rate $10^{-3}$ , weight decay $10^{-5}$ ), early stopping (patience 20, up to 100 epochs), and batch size 1 (i.e., one window graph per batch). Key regularization and augmentation include:

Node masking: 20–40% of $C$ -nodes masked per epoch
Edge dropout: 10–20%
Negative-edge ratio: 20–40%
Encoder/decoder GNN depth: 1–2 layers
Latent size: $k=32$
Feature loss: MSE or cosine embedding distance
Loss weights and threshold grid: $\alpha, \beta \in \{0.1, 0.5, 1.0\}$ , $\gamma \in \{0.1, 0.5, 1.0\}$ , $\tau \in \{95,97,99\}$ (percentile)

Model selection and threshold tuning are performed via grid search on a held-out, mildly contaminated validation split.

5. Performance Evaluation and Runtime Analysis

Benchmarks are performed on the UNSW-NB15 dataset using sliding time windows, with contamination rates of 0%, 3.5%, and 5.7%. AutoGraphAD is compared to Anomal-E’s PyOD-based downstream detectors (PCA, CBLOF, HBOS):

Contamination	Method	Accuracy (%)	F $_1$ -Macro (%)	Recall (%)
0%	PCA (Anomal-E)	96.65	82.39	98.27
0%	CBLOF (Anomal-E)	96.21	79.96	94.46
0%	HBOS (Anomal-E)	96.68	82.50	98.28
0%	AutoGraphAD	97.69	84.23	87.98

Comparable or superior results are observed at higher contamination levels.

Runtime per window:

Method	Train (s)	Inference (s)
PCA	0.0885	0.0286
CBLOF	0.0454	0.0323
HBOS	0.1359	0.0869
AutoGraphAD (end-to-end)	0.0060	0.0046

AutoGraphAD is 1.18 orders of magnitude faster in training and 1.03 orders faster in inference compared to the Anomal-E+PyOD pipeline. This suggests substantial suitability for live operational deployment and concept-drift adaptation.

6. Advantages, Limitations, and Extensions

Strengths:

Fully unsupervised: eliminates reliance on labeled flows or additional anomaly detectors.
Produces an end-to-end anomaly score based on reconstruction and variational residuals.
GPU-native operation with $k=32$ latent dimension yields high per-window throughput.
Simple retuning for concept drift via adjustment of loss and threshold hyperparameters.

Limitations:

Performance and detection robustness exhibit sensitivity to the selection of $\alpha$ , $\beta$ , $\gamma$ , and the anomaly percentile $\tau$ .
Static windowing may miss low-rate or long-duration attacks that span multiple time slices.
No explicit modeling of $IP$ – $IP$ interactions, which could be important for detecting lateral movement.

Potential extensions include temporal/dynamic VGAE models over window sequences, adversarial regularization on latent embeddings akin to AR-VGAE, explicit integration of edge features into the decoder, and cross-corpus training with multi-task contrastive objectives.

A plausible implication is that end-to-end, graph-native anomaly scoring may become the preferred paradigm for high-throughput NIDS settings, pending refinements to temporal and multi-view regularization (Anyfantis et al., 21 Nov 2025).

PDF Markdown Chat (Pro)

References (1)

AutoGraphAD: A novel approach using Variational Graph Autoencoders for anomalous network flow detection (2025)

AutoGraphAD: Unsupervised Graph NIDS

1. Graph Construction from Network Flows

2. Model Architecture: Heterogeneous Variational Graph Autoencoder

3. Losses, Training Objectives, and Anomaly Scoring

4. Training Procedure and Hyperparameter Choices

5. Performance Evaluation and Runtime Analysis

6. Advantages, Limitations, and Extensions

Whiteboard

Follow Topic

Continue Learning

AutoGraphAD: Unsupervised Graph NIDS

1. Graph Construction from Network Flows

2. Model Architecture: Heterogeneous Variational Graph Autoencoder

3. Losses, Training Objectives, and Anomaly Scoring

4. Training Procedure and Hyperparameter Choices

5. Performance Evaluation and Runtime Analysis

6. Advantages, Limitations, and Extensions

Sponsor

Whiteboard

Follow Topic

Continue Learning

Related Topics