FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy (2302.10429v2)

Published 21 Feb 2023 in cs.LG, cs.DC, and math.OC

Abstract: Federated learning is an emerging distributed machine learning framework which jointly trains a global model via a large number of local devices with data privacy protections. Its performance suffers from the non-vanishing biases introduced by the local inconsistent optimal and the rugged client-drifts by the local over-fitting. In this paper, we propose a novel and practical method, FedSpeed, to alleviate the negative impacts posed by these problems. Concretely, FedSpeed applies the prox-correction term on the current local updates to efficiently reduce the biases introduced by the prox-term, a necessary regularizer to maintain the strong local consistency. Furthermore, FedSpeed merges the vanilla stochastic gradient with a perturbation computed from an extra gradient ascent step in the neighborhood, thereby alleviating the issue of local over-fitting. Our theoretical analysis indicates that the convergence rate is related to both the communication rounds $T$ and local intervals $K$ with a upper bound $\small \mathcal{O}(1/T)$ if setting a proper local interval. Moreover, we conduct extensive experiments on the real-world dataset to demonstrate the efficiency of our proposed FedSpeed, which performs significantly faster and achieves the state-of-the-art (SOTA) performance on the general FL experimental settings than several baselines. Our code is available at \url{https://github.com/woodenchild95/FL-Simulator.git}.

References (56)

Authors (5)

Yan Sun (309 papers)
Li Shen (363 papers)
Tiansheng Huang (30 papers)
Liang Ding (159 papers)
Dacheng Tao (830 papers)

Citations (41)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

GitHub

GitHub - woodenchild95/FL-Simulator: Pytorch implementations of some general federated optimization methods. (34 stars)

FedSpeed: Larger Local Interval, Less Communication Round, and Higher Generalization Accuracy (2302.10429v2)

Summary

Related Papers

GitHub