Federated Temporal Graph Clustering
- Federated Temporal Graph Clustering (FTGC) is a decentralized method that clusters dynamic graph data by combining temporal aggregation with federated learning while preserving data privacy.
- FTGC employs graph neural networks for spatial feature extraction and a temporal window mechanism to capture evolving graph structures in client data.
- The framework utilizes federated averaging with parameter sparsification and quantization to ensure efficient communication and high clustering performance, as evidenced on datasets like DBLP and School.
Federated Temporal Graph Clustering (FTGC) is a decentralized approach for clustering dynamic graph data distributed across multiple clients, each holding a private sequence of temporal graph snapshots. FTGC addresses the challenges of temporal graph clustering under data privacy constraints, enabling collaborative discovery of evolving graph structures without centralizing raw data. The framework uses graph neural networks (GNNs) with a specialized temporal aggregation mechanism and a federated learning protocol, balancing clustering fidelity, temporal smoothness, communication efficiency, and privacy preservation (Zhou et al., 2024).
1. Formalization and Clustering Objective
FTGC operates over clients, each storing a temporal graph sequence:
with the node set, the edge set, and node features at time . The clustering task is to compute, for each , a soft assignment into clusters, co-clustering nodes with strong temporal and spatial connectivity and ensuring cluster assignment smoothness over time.
The global objective aggregates local clustering losses, subject to model consistency enforced via federated aggregation:
where 0 are client-local parameters. The core clustering subproblem per client 1 is
2
where 3 is the graph Laplacian and 4 regulates temporal smoothness.
2. Temporal Aggregation and Embedding Construction
Each client computes node embeddings 5 by integrating spatial and temporal information. Spatial aggregation uses a graph convolutional approach:
6
where 7 and 8 is a nonlinear activation. Temporal aggregation leverages a temporal window of size 9:
0
with learnable attention weights 1 (softmax-normalized) and per-offset matrices 2. The final temporal-spatial node embedding is
3
This mechanism captures both local graph structure and its temporal evolution, enabling the model to learn temporally coherent cluster representations.
3. Federated Optimization and Training Process
FTGC employs a federated averaging (FedAvg) protocol augmented with model update compression for scalable and communication-efficient training. The training proceeds over 4 rounds:
- Server broadcasts global model 5.
- Each client (in parallel):
- Receives 6, initializes 7.
- Performs 8 local epochs: computes temporal embeddings 9 for all 0, evaluates and optimizes local loss 1.
- Computes update 2, sparsifies to top 3 entries (4), quantizes (5), and uploads 6.
- Server aggregates updates:
7
Raw graph data 8 and features 9 remain on client devices, ensuring privacy at all stages.
4. Loss Function and Regularization
The per-client loss optimized during local training is composed of a clustering term and a temporal smoothness regularizer:
0
Optionally, an 1 penalty may be applied to 2:
3
Global optimization minimizes the average total loss 4 via the federated loop.
5. Experimental Protocol and Performance
Experiments are conducted on a range of real-world temporal graph datasets partitioned across 5 clients:
- DBLP (co-author network)
- Brain (functional connectivity)
- Patent (citation network)
- School (contact network)
Key experimental hyperparameters include temporal window 6, cluster count 7 (dataset-dependent), local epochs 8, rounds 9, learning rate 0, and compression sparsity 1. Evaluation metrics encompass Clustering Accuracy (ACC), Normalized Mutual Information (NMI), Adjusted Rand Index (ARI), and F1-score (F1).
| Dataset | ACC [%] | NMI [%] | ARI [%] | F1 [%] |
|---|---|---|---|---|
| DBLP | 49.50 | 38.00 | 23.50 | 46.00 |
| Brain | 45.00 | 51.00 | 31.00 | 45.00 |
| Patent | 51.00 | 26.00 | 19.50 | 39.50 |
| School | 99.80 | 99.50 | 99.40 | 99.80 |
FTGC (with 2 clients) consistently matches or outperforms centralized methods such as TGC and TREND, without centralizing raw data.
6. Communication Efficiency and Privacy Protection
Communication overhead is minimized via:
- Transmission of only parameter deltas (3) instead of full model weights
- Sparsification to transmit only the top 4 of gradient entries (e.g., 5)
- Quantization 6 to 8/16-bit precision
Clients perform multiple local updates before transmitting, reducing synchronization frequency. Data privacy is maintained since neither graph structures 7 nor node features 8 are uploaded. Additional protections, such as secure aggregation or differential privacy noise addition to 9, can further enhance privacy properties as needed.
FTGC establishes a robust framework for federated clustering of dynamic graphs, balancing synchronization efficiency, privacy, and clustering quality in a decentralized setting (Zhou et al., 2024).