Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learning Reward Machines in Cooperative Multi-Agent Tasks (2303.14061v4)

Published 24 Mar 2023 in cs.AI, cs.MA, and cs.SC

Abstract: This paper presents a novel approach to Multi-Agent Reinforcement Learning (MARL) that combines cooperative task decomposition with the learning of reward machines (RMs) encoding the structure of the sub-tasks. The proposed method helps deal with the non-Markovian nature of the rewards in partially observable environments and improves the interpretability of the learnt policies required to complete the cooperative task. The RMs associated with each sub-task are learnt in a decentralised manner and then used to guide the behaviour of each agent. By doing so, the complexity of a cooperative multi-agent problem is reduced, allowing for more effective learning. The results suggest that our approach is a promising direction for future research in MARL, especially in complex environments with large state spaces and multiple agents.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Leo Ardon (15 papers)
  2. Daniel Furelos-Blanco (9 papers)
  3. Alessandra Russo (48 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.