Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Distributed off-Policy Actor-Critic Reinforcement Learning with Policy Consensus (1903.09255v1)

Published 21 Mar 2019 in cs.LG, cs.AI, math.OC, and stat.ML

Abstract: In this paper, we propose a distributed off-policy actor critic method to solve multi-agent reinforcement learning problems. Specifically, we assume that all agents keep local estimates of the global optimal policy parameter and update their local value function estimates independently. Then, we introduce an additional consensus step to let all the agents asymptotically achieve agreement on the global optimal policy function. The convergence analysis of the proposed algorithm is provided and the effectiveness of the proposed algorithm is validated using a distributed resource allocation example. Compared to relevant distributed actor critic methods, here the agents do not share information about their local tasks, but instead they coordinate to estimate the global policy function.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (2)
  1. Yan Zhang (954 papers)
  2. Michael M. Zavlanos (65 papers)
Citations (47)

Summary

We haven't generated a summary for this paper yet.