FLTrust: Byzantine-Resilient FL Aggregation

Updated 7 March 2026

FLTrust is a Byzantine-resilient federated learning protocol that leverages a trusted server-side root dataset to establish a reference update and mitigate model-poisoning attacks.
It computes cosine similarity between normalized client updates and the server’s trusted update, assigning zero weight to misaligned contributions.
Empirical results show FLTrust maintains high accuracy under up to 60% Byzantine attacks, outperforming traditional aggregation methods.

FLTrust is a Byzantine-resilient aggregation protocol for federated learning (FL) that leverages a server-side root dataset to establish a reference model update and quantifiably trust client contributions. It provides robust resilience against malicious (Byzantine) participants in settings where a small, clean, manually curated dataset can be made available to the server. The central principle is directional trust bootstrapping: model updates are admitted to aggregation and weighted proportionally based on their cosine-alignment with the trusted server update, with negative/orthogonal directions suppressed. This approach provides statistical guarantees on tolerance to arbitrary model-poisoning attacks and efficient convergence, with minimal degradation in utility. The protocol is widely referenced as a foundation for secure and privacy-preserving FL mechanisms (Cao et al., 2020, Xia et al., 2024, Xia et al., 2024).

1. System Model, Threat Model, and Motivation

FLTrust assumes a federated system with $n$ clients and a centralized server ("federator"). The server is trusted to enforce protocol steps but may be curious about client data ("honest-but-curious" adversary). The adversary can fully control up to $m < n$ clients (Byzantine clients), enabling them to submit arbitrary model updates, possibly with full knowledge of all honest updates. The goal is to aggregate local client updates $g_i$ to form a robust global model update, even when a substantial fraction is adversarial, while maintaining computational and communication efficiency.

The core motivation is the inadequacy of prior statistical and geometric defense mechanisms when facing adaptive, omniscient adversaries capable of maximizing their attack payloads. In particular, prior robust aggregation rules (e.g., Krum, Trimmed mean, Median) exhibit rapid failure under targeted attacks that are designed with full knowledge of their protocols (Cao et al., 2020).

2. Root-of-Trust Principle and Server Update

FLTrust introduces an explicit "root of trust" by equipping the server with a small, vetted root dataset $D_0$ (typically $\lvert D_0 \rvert \approx 100$ ). In each communication round, the server computes a reference update $g_0$ (or $\Delta_0$ ) by applying stochastic gradient descent steps to $D_0$ . This reference provides a trusted direction in parameter space intended to reflect descent on the true objective, serving as a baseline for evaluating client contributions (Cao et al., 2020, Xia et al., 2024, Xia et al., 2024).

The mechanism assumes the root dataset is free from Byzantine manipulation and is (approximately) representative of the overall task distribution. If the root data is highly biased or poisoned, FLTrust's trust weighting becomes unreliable.

3. Trust Score Computation and Aggregation Rule

The defining operation of FLTrust is the assignment of a trust score to each client update according to its directional alignment with the server's root update. Formally, after receiving normalized update vectors $\bar{g}_i = g_i / \|g_i\|$ (or equivalently, $\overline{\Delta}_i$ ), the server computes: $\cos\theta_i = \langle \bar{g}_0, \bar{g}_i \rangle \in [-1,1]$ and then

$TS_i = \max(0, \cos\theta_i)$

This is the application of the ReLU function to cosine similarity, producing a trust score $TS_i \in [0,1]$ for each client (Cao et al., 2020, Xia et al., 2024, Xia et al., 2024). Client updates with non-positive alignment with respect to the server reference are assigned zero weight and pruned from aggregation.

Aggregation proceeds as a weighted average: $S = \sum_{i=1}^n TS_i \ V = \sum_{i=1}^n TS_i \cdot \bar{g}_i \ g_{\text{agg}} = \frac{V}{S}$ with $g_{\text{agg}}$ further rescaled so that its magnitude matches that of the server's root update, thereby preventing adversarial manipulation via magnitude inflation. The global model is updated as $w \leftarrow w + \alpha g_{\text{agg}}$ (Cao et al., 2020, Xia et al., 2024).

4. Formal Security and Convergence Guarantees

FLTrust is provably robust against arbitrary Byzantine adversaries under standard convexity and concentration hypotheses. Specifically, when the global objective is $\mu$ -strongly convex and gradient noise on $D_0$ is sub-exponential, FLTrust achieves geometric convergence up to error $O(\alpha/\sqrt{|D_0|})$ independent of the Byzantine fraction, as formalized in Theorem 1 of (Cao et al., 2020). The proof leverages the normalization and directional clipping to bound adversarial influence, and concentration inequalities to control the noise from the root dataset.

Importantly, even with 40–60% Byzantine clients, FLTrust maintains test error within $0.01$ of the no-attack FedAvg baseline, outperforming other robust aggregation baselines (Krum, Trim, Median) under various model-poisoning and label-flipping attacks (Cao et al., 2020). Key experimental results are summarized below:

Setting	FLTrust Error	Krum/Trim/Median Error	Under Attack
MNIST, No Attack	$\sim$ 4%	6–10% overhead	No
Krum-Attack	$\sim$ 4%	90%	Yes
Scaling Backdoor	$\leq$ 3%	Up to 100%	Yes

FLTrust's overhead relative to FedAvg is negligible: the server only executes an additional root-model update per round and computes vector inner-products and scaling per client update (Cao et al., 2020, Xia et al., 2024).

5. Limitations and Known Attack Vectors

FLTrust's robustness depends fundamentally on the integrity and representativeness of the root dataset $D_0$ . If $D_0$ is too small or systematically biased, the pruning step can inappropriately down-weight legitimate updates, reducing accuracy or stalling convergence (Cao et al., 2020, Xia et al., 2024, Xia et al., 2024). A sophisticated adversary might craft updates that mimic the direction of $\bar{g}_0$ (so $\cos\theta_i \approx 1$ ) while still embedding malicious payloads orthogonal to benign data directions; magnitude normalization eliminates scaling attacks but does not address this subtle form of collusion.

FLTrust's design is inherently centralized around a single federator and root dataset, limiting its applicability to decentralized or multi-server FL. It assumes that the server does not act maliciously—a threat model not covered by the protocol.

6. Extensions: Privacy-Preserving and Communication-Efficient Variants

Several works have extended the FLTrust codec to offer privacy and better communication complexity. ByITFL (Xia et al., 2024) and LoByITFL (Xia et al., 2024) provide full information-theoretic privacy against both the federator and up to $t$ colluding clients, while retaining Byzantine robustness. These extensions substitute the ReLU trust score with a low-degree polynomial approximation, which is amenable to secure multiparty computation (MPC) techniques using Shamir secret sharing and Beaver triples.

In LoByITFL (Xia et al., 2024), a one-time trusted third party generates the cryptographic material for the MPC (random pads, Beaver triples, MAC keys for integrity), after which all rounds proceed efficiently with only $O(n)$ field-element communications per client per round. The privacy guarantee is perfect: information-theoretic secrecy holds for individual updates against the federator and up to $t$ colluding clients. The core aggregation rule and the trust mechanism remain functionally equivalent to the original FLTrust, with minor differences in the trust function's nonlinearity (polynomial rather than ReLU).

Empirical results on FLTrust and its variants consistently validate their resilience under strong Byzantine and adaptive attacks, with minimal degradation of accuracy and communication efficiency (Cao et al., 2020, Xia et al., 2024, Xia et al., 2024).

7. Summary Table of Core FLTrust Scheme

Step	Operation	Rationale
Root update	Server computes $g_0$ from $D_0$	Establish reference direction
Broadcast	Server shares $w$ with clients	Synchronization
Client local updates	Each computes $g_i$ on $D_i$	Data-local SGD
Normalization	All $g_i \to \bar{g}_i = g_i/\\|g_i\\|$	Block magnitude attacks
Trust score	$TS_i = \max(0, \langle \bar{g}_0, \bar{g}_i \rangle )$	Directional filtering
Weighted aggregation	$g_{\text{agg}}= \left( \sum TS_i \cdot \bar{g}_i \right)/\sum TS_i$	Reduce malicious impact
Magnitude scaling	Rescale $g_{\text{agg}}$ to $\\|g_0\\|$	Consistent step-size
Model update	$w \leftarrow w + \alpha g_{\text{agg}}$	SGD

In summary, FLTrust represents a significant advance in robust federated learning aggregation by operationalizing explicit trust bootstrapping and rigid directional filtering as a defense against arbitrary Byzantine attacks. Its core mechanisms are retained in privacy-enhanced derivatives, and empirical analysis consistently demonstrates high utility and resilience under a range of adversarial conditions (Cao et al., 2020, Xia et al., 2024, Xia et al., 2024).