Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

An Efficient Transfer Learning Framework for Multiagent Reinforcement Learning (2002.08030v4)

Published 19 Feb 2020 in cs.MA

Abstract: Transfer Learning has shown great potential to enhance single-agent Reinforcement Learning (RL) efficiency. Similarly, Multiagent RL (MARL) can also be accelerated if agents can share knowledge with each other. However, it remains a problem of how an agent should learn from other agents. In this paper, we propose a novel Multiagent Policy Transfer Framework (MAPTF) to improve MARL efficiency. MAPTF learns which agent's policy is the best to reuse for each agent and when to terminate it by modeling multiagent policy transfer as the option learning problem. Furthermore, in practice, the option module can only collect all agent's local experiences for update due to the partial observability of the environment. While in this setting, each agent's experience may be inconsistent with each other, which may cause the inaccuracy and oscillation of the option-value's estimation. Therefore, we propose a novel option learning algorithm, the successor representation option learning to solve it by decoupling the environment dynamics from rewards and learning the option-value under each agent's preference. MAPTF can be easily combined with existing deep RL and MARL approaches, and experimental results show it significantly boosts the performance of existing methods in both discrete and continuous state spaces.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (12)
  1. Tianpei Yang (25 papers)
  2. Weixun Wang (31 papers)
  3. Hongyao Tang (28 papers)
  4. Jianye Hao (185 papers)
  5. Zhaopeng Meng (23 papers)
  6. Hangyu Mao (37 papers)
  7. Dong Li (429 papers)
  8. Wulong Liu (38 papers)
  9. Chengwei Zhang (19 papers)
  10. Yujing Hu (28 papers)
  11. Yingfeng Chen (30 papers)
  12. Changjie Fan (79 papers)
Citations (18)

Summary

We haven't generated a summary for this paper yet.