Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Solving Transition-Independent Multi-agent MDPs with Sparse Interactions (Extended version) (1511.09047v2)

Published 29 Nov 2015 in cs.AI and cs.MA

Abstract: In cooperative multi-agent sequential decision making under uncertainty, agents must coordinate to find an optimal joint policy that maximises joint value. Typical algorithms exploit additive structure in the value function, but in the fully-observable multi-agent MDP setting (MMDP) such structure is not present. We propose a new optimal solver for transition-independent MMDPs, in which agents can only affect their own state but their reward depends on joint transitions. We represent these dependencies compactly in conditional return graphs (CRGs). Using CRGs the value of a joint policy and the bounds on partially specified joint policies can be efficiently computed. We propose CoRe, a novel branch-and-bound policy search algorithm building on CRGs. CoRe typically requires less runtime than the available alternatives and finds solutions to problems previously unsolvable.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Joris Scharpff (1 paper)
  2. Diederik M. Roijers (40 papers)
  3. Frans A. Oliehoek (56 papers)
  4. Matthijs T. J. Spaan (28 papers)
  5. Mathijs M. de Weerdt (10 papers)
Citations (32)

Summary

We haven't generated a summary for this paper yet.