Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency (2405.17471v2)

Published 24 May 2024 in cs.LG and cs.AI

Abstract: Federated Reinforcement Learning (FRL) has garnered increasing attention recently. However, due to the intrinsic spatio-temporal non-stationarity of data distributions, the current approaches typically suffer from high interaction and communication costs. In this paper, we introduce a new FRL algorithm, named $\texttt{MFPO}$, that utilizes momentum, importance sampling, and additional server-side adjustment to control the shift of stochastic policy gradients and enhance the efficiency of data utilization. We prove that by proper selection of momentum parameters and interaction frequency, $\texttt{MFPO}$ can achieve $\tilde{\mathcal{O}}(H N^{{-1}\epsilon^{-3/2})$} and $\tilde{\mathcal{O}}(\epsilon^{-1})$ interaction and communication complexities ($N$ represents the number of agents), where the interaction complexity achieves linear speedup with the number of agents, and the communication complexity aligns the best achievable of existing first-order FL algorithms. Extensive experiments corroborate the substantial performance gains of $\texttt{MFPO}$ over existing methods on a suite of complex and high-dimensional benchmarks.

References (35)

Authors (4)

Sheng Yue (13 papers)
Xingyuan Hua (4 papers)
Lili Chen (34 papers)
Ju Ren (33 papers)

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Momentum-Based Federated Reinforcement Learning with Interaction and Communication Efficiency (2405.17471v2)

Summary

Related Papers

Tweets