CRFL: Certifiably Robust Federated Learning against Backdoor Attacks (2106.08283v1)

Published 15 Jun 2021 in cs.LG

Abstract: Federated Learning (FL) as a distributed learning paradigm that aggregates information from diverse clients to train a shared global model, has demonstrated great success. However, malicious clients can perform poisoning attacks and model replacement to introduce backdoors into the trained global model. Although there have been intensive studies designing robust aggregation methods and empirical robust federated training protocols against backdoors, existing approaches lack robustness certification. This paper provides the first general framework, Certifiably Robust Federated Learning (CRFL), to train certifiably robust FL models against backdoors. Our method exploits clipping and smoothing on model parameters to control the global model smoothness, which yields a sample-wise robustness certification on backdoors with limited magnitude. Our certification also specifies the relation to federated learning parameters, such as poisoning ratio on instance level, number of attackers, and training iterations. Practically, we conduct comprehensive experiments across a range of federated datasets, and provide the first benchmark for certified robustness against backdoor attacks in federated learning. Our code is available at https://github.com/AI-secure/CRFL.

PDF Abstract

Certifiably Robust Federated Learning against Backdoor Attacks

This paper addresses a critical challenge in Federated Learning (FL): the susceptibility of global models to backdoor attacks introduced by malicious clients. Traditional FL frameworks aggregate updates from various clients, creating a shared global model. However, these models are at risk of being compromised by clients who inject malicious data or perturbations into the training process. Existing methodologies have not adequately addressed the certification of robustness against such backdoor attacks, thus motivating this work.

The authors present a novel framework termed Certifiably Robust Federated Learning (CRFL), designed to achieve robustness certification against backdoor attacks. The certification process within CRFL is accomplished by implementing clipping and smoothing techniques on model parameters to ensure the global model remains stable despite possible adversarial influences.

Key Methodologies and Theoretical Contributions

CRFL's methodology is multifaceted. During the training phase, each update to the model parameters is clipped to maintain a bounded norm, followed by the addition of Gaussian noise to the aggregated model parameters. This dual approach controls the propagation of deviations caused by potential backdoors across the federated system.

The theoretical foundation of CRFL relies on robust certification derived from the model's parameter closeness, expressed through KL divergence, measured from the perspective of Markov Kernels. This framework allows the authors to extend the analysis of model stability and consistency, leading to bounds on backdoor patch magnitudes that the trained model can tolerate without misclassification on the augmented test input.

Aarising from this theoretical construct, CRFL provides guarantees linking certified robustness to crucial parameters in distributed training, such as the ratio of poisoned data, the number of malicious clients, and the total number of iterations in the training process.

Empirical Evaluation and Findings

Empirical evaluations span across datasets such as MNIST, EMNIST, and real financial data, illustrating CRFL's efficacy. The results demonstrate that the proposed robustness certifications align well with empirical observations, affirming the practical implications of theoretical analyses.

Key findings include the observation that adjusting hyperparameters, like the noise level and clipping norm, significantly affects the robustness-accuracy trade-off, a pivotal insight for tuning federated learning systems under adversarial conditions. The empirical results also underline the critical role of the number of training iterations, showing that increasing the number of benign fine-tuning rounds after a backdoor injection helps in mitigating the impact of adversarial perturbations.

Future Implications and Speculation

The introduction of CRFL marks a pivotal step forward in federated learning by incorporating robustness certifications, thereby advancing trust in deploying FL in real-world scenarios where security is paramount. This work lays a foundational framework upon which further optimizations can be made, particularly in fine-tuning certification parameters to optimize both accuracy and security.

Moreover, the robustness certification framework can evolve with the integration into diverse adversarial threat models and defense strategies, potentially adapting to broader contexts beyond federated learning, such as robust distributed optimization and privacy-preserving machine learning systems.

In conclusion, the paper presents an intricate blend of theoretical synthesis and empirical validation, contributing significantly to the field of federated learning through the lens of security and robustness certifications.

PDF Markdown Bookmark Chat (Pro)

Authors (4)

Chulin Xie (27 papers)
Minghao Chen (37 papers)
Pin-Yu Chen (311 papers)
Bo Li (1107 papers)

Citations (145)

View on Semantic Scholar

Related Papers

Find Related Papers

GitHub

GitHub - AI-secure/CRFL: CRFL: Certifiably Robust Federated Learning against Backdoor Attacks (ICML 2021) (69 stars)