Introduction
Within the field of multi-agent systems, efficiently navigating autonomous agents toward specific goals, particularly in contexts where multiple entities operate independently, presents an intricate challenge. Classical methods, based in planning, come with their limitations when it comes to computation overhead and flexibility. Reinforcement learning (RL) alternatives offer promise in this area, providing robust representation capabilities; however, these models encounter difficulties with data efficiency and cooperation.
Hierarchical Framework and GNN
The Multi-Agent Scalable GNN-based Planner (MASP) is built around a hierarchical framework that effectively reduces the high-dimensional search space involved in navigation tasks through its division into smaller manageable regions. This structure significantly accelerates the convergence of training and boosts data efficiency. To better facilitate cooperation and goal attainment among agents, MASP integrates Graph Neural Networks (GNN), which enable a deep understanding of the inter-agent relationships and interactions with goals.
MASP is comprised of two key components:
- Multi-Goal Matcher (MGM): It employs a decentralized graph matching strategy that assigns the most appropriate goals to agents at each global step.
- Coordinated Action Executor (CAE): With a Graph Merger and Goal Encoder, this component captures the essential correlation between agents and their assigned goals, promoting synergistic cooperation.
Experimental Performance
Empirically, MASP demonstrates superior performance compared to existing planning-based methods and RL competitors. In environments like MPE and Omnidrones that accommodate large groups of agents, MASP achieves nearly perfect success rates with minimal steps taken. Notably, in challenging 3D simulations involving up to 20 agents, MASP displays striking generalization abilities, as it performs effectively even in scenarios composed of unseen team sizes.
Conclusion
MASP validates its efficiency in establishing cooperative strategies and its adaptability to complex and dynamic environmental conditions. It does so while also demonstrating strong generalization capabilities and impressive performance in scenarios with large numbers of agents. This makes MASP a compelling approach for decentralized multi-agent navigation tasks and opens avenues for broader applications in multi-agent systems.