Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Non-Stationary Policy Learning for Multi-Timescale Multi-Agent Reinforcement Learning (2307.08794v1)

Published 17 Jul 2023 in cs.LG, cs.AI, cs.MA, cs.SY, and eess.SY

Abstract: In multi-timescale multi-agent reinforcement learning (MARL), agents interact across different timescales. In general, policies for time-dependent behaviors, such as those induced by multiple timescales, are non-stationary. Learning non-stationary policies is challenging and typically requires sophisticated or inefficient algorithms. Motivated by the prevalence of this control problem in real-world complex systems, we introduce a simple framework for learning non-stationary policies for multi-timescale MARL. Our approach uses available information about agent timescales to define a periodic time encoding. In detail, we theoretically demonstrate that the effects of non-stationarity introduced by multiple timescales can be learned by a periodic multi-agent policy. To learn such policies, we propose a policy gradient algorithm that parameterizes the actor and critic with phase-functioned neural networks, which provide an inductive bias for periodicity. The framework's ability to effectively learn multi-timescale policies is validated on a gridworld and building energy management environment.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (4)
  1. Patrick Emami (14 papers)
  2. Xiangyu Zhang (329 papers)
  3. David Biagioni (9 papers)
  4. Ahmed S. Zamzam (23 papers)

Summary

We haven't generated a summary for this paper yet.