Towards Trustworthy Federated Learning with Untrusted Participants

Published 3 May 2025 in cs.LG, cs.CR, and cs.DC | (2505.01874v2)

Abstract: Resilience against malicious participants and data privacy are essential for trustworthy federated learning, yet achieving both with good utility typically requires the strong assumption of a trusted central server. This paper shows that a significantly weaker assumption suffices: each pair of participants shares a randomness seed unknown to others. In a setting where malicious participants may collude with an untrusted server, we propose CafCor, an algorithm that integrates robust gradient aggregation with correlated noise injection, using shared randomness between participants. We prove that CafCor achieves strong privacy-utility trade-offs, significantly outperforming local differential privacy (DP) methods, which do not make any trust assumption, while approaching central DP utility, where the server is fully trusted. Empirical results on standard benchmarks validate CafCor's practicality, showing that privacy and robustness can coexist in distributed systems without sacrificing utility or trusting the server.

Abstract PDF Upgrade to Chat

Authors (3)

Summary

Overview of "Towards Trustworthy Federated Learning with Untrusted Participants"

In the domain of distributed machine learning, federated learning has emerged as a crucial approach, particularly for scenarios necessitating privacy, such as healthcare where sensitive data is involved. The paper "Towards Trustworthy Federated Learning with Untrusted Participants" addresses the inherent challenges of preserving privacy and resilience against adversarial attacks in federated learning structures. The study is undertaken in the context where a central server, often employed for managing distributed computations, cannot be blindly trusted. This necessitates alternative architectures that can maintain high utility without the assumption of trustworthiness attributed to the server.

Core Contributions

This paper introduces CafCor, an algorithm designed to operate under the paradigm of secret-based local differential privacy (SecLDP). It is tailored to ensure both privacy and robustness amidst potential adversaries. The uniqueness of CafCor resides in its ability to forego the need for a central trusted entity by employing a shared randomness scheme among participating workers. This functionality is achieved through a correlated noise injection coupled with robust gradient aggregation techniques. The authors establish the algorithm’s effectiveness against typical threats such as colluding malicious workers and an unscrupulous server.

Technical Innovation

Correlated Noise Mechanism: CafCor leverages shared randomness between pairs of workers to generate correlated, cancelable noise, allowing it to approach the central differential privacy (CDP) levels of utility while adhering to privacy constraints.
CAF Aggregation: A novel aggregation method, Covariance-bound Agnostic Filter (CAF), designed to handle adversarial impact without requiring knowledge of honest inputs' covariance. CAF effectively lowers the influence of Byzantine workers, thus optimizing the resilience of the federated learning process.

Theoretical Implications

The paper rigorously analyzes the privacy-utility trade-off achieved by CafCor, demonstrating near-CDP performance when the server and malicious workers collude to some extent. The theoretical frameworks for resilience are built upon assumptions standard in federated learning, ensuring practical applicability. The proposed robust aggregation method eschews prior requirements of covariance bounds, broadening the applicability in realistic heterogeneous datasets.

Empirical Evaluation

Empirical data, derived from standard benchmarks such as MNIST and Fashion-MNIST datasets, corroborates CafCor’s theoretical guarantees. The experiments illustrate its advantage over local differential privacy (LDP) based methods and show comparable performance levels to centralized differential privacy under controlled conditions.

Concluding Remarks and Future Directions

CafCor establishes a new benchmark in federated learning by demonstrating that robustness and privacy need not be mutually exclusive under non-trusted architectures. This paper lays the groundwork for further exploration in optimizing federated learning in environments fraught with adversarial risks and privacy concerns. Future developments may focus on minimizing computational overhead and extending the correlated noise approach to broader communication models to enhance scalability across larger federations.