Graph Neural Network-based Multi-agent Reinforcement Learning for Resilient Distributed Coordination of Multi-Robot Systems (2403.13093v1)
Abstract: Existing multi-agent coordination techniques are often fragile and vulnerable to anomalies such as agent attrition and communication disturbances, which are quite common in the real-world deployment of systems like field robotics. To better prepare these systems for the real world, we present a graph neural network (GNN)-based multi-agent reinforcement learning (MARL) method for resilient distributed coordination of a multi-robot system. Our method, Multi-Agent Graph Embedding-based Coordination (MAGEC), is trained using multi-agent proximal policy optimization (PPO) and enables distributed coordination around global objectives under agent attrition, partial observability, and limited or disturbed communications. We use a multi-robot patrolling scenario to demonstrate our MAGEC method in a ROS 2-based simulator and then compare its performance with prior coordination approaches. Results demonstrate that MAGEC outperforms existing methods in several experiments involving agent attrition and communication disturbance, and provides competitive results in scenarios without such anomalies.
- E. Taranta, A. Seiwert, A. Goeckner, K. Nguyen, and E. Cherry, “From Warfighting Needs to Robot Actuation: A Complete Rapid Integration Swarming Solution,” FR, vol. 3, pp. 460–515, Jan. 2023.
- J. Gilmer, S. S. Schoenholz, P. F. Riley, O. Vinyals, and G. E. Dahl, “Neural Message Passing for Quantum Chemistry,” in Proceedings of the 34th International Conference on Machine Learning, pp. 1263–1272, PMLR, July 2017.
- T. Yao, Y. Wang, K. Zhang, and S. Liang, “Improving the Expressiveness of K-hop Message-Passing GNNs by Injecting Contextualized Substructure Information,” in KDD, KDD ’23, (New York, NY, USA), pp. 3070–3081, Association for Computing Machinery, Aug. 2023.
- W. L. Hamilton, R. Ying, and J. Leskovec, “Inductive Representation Learning on Large Graphs,” Sept. 2018.
- K. Xu, C. Li, Y. Tian, T. Sonobe, K.-i. Kawarabayashi, and S. Jegelka, “Representation Learning on Graphs with Jumping Knowledge Networks,” in Proceedings of the 35th International Conference on Machine Learning, pp. 5453–5462, PMLR, July 2018.
- Y. Zhou, H. Huo, Z. Hou, L. Bu, J. Mao, Y. Wang, X. Lv, and F. Bu, “Co-embedding of edges and nodes with deep graph convolutional neural networks,” Sci Rep, vol. 13, p. 16966, Oct. 2023.
- S. Zhang, K. Garg, and C. Fan, “Neural graph control barrier functions guided distributed collision-avoidance multi-agent control,” in 7th Annual Conference on Robot Learning, 2023.
- S. Zhang, O. So, K. Garg, and C. Fan, “GCBF+: A Neural Graph Control Barrier Function Framework for Distributed Safe Multi-Agent Control,” Jan. 2024.
- N. Naderializadeh, F. H. Hung, S. Soleyman, and D. Khosla, “Graph Convolutional Value Decomposition in Multi-Agent Reinforcement Learning,” Feb. 2021.
- S. Ding, W. Du, L. Ding, J. Zhang, L. Guo, and B. An, “Multiagent Reinforcement Learning With Graphical Mutual Information Maximization,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–10, 2023.
- Y. Hu, J. Fu, and G. Wen, “Graph Soft Actor–Critic Reinforcement Learning for Large-Scale Distributed Multirobot Coordination,” IEEE Transactions on Neural Networks and Learning Systems, pp. 1–12, 2023.
- D. Portugal and R. P. Rocha, “Distributed multi-robot patrol: A scalable and fault-tolerant framework,” Robotics and Autonomous Systems, vol. 61, pp. 1572–1587, Dec. 2013.
- D. Portugal and R. P. Rocha, “Cooperative multi-robot patrol with Bayesian learning,” Auton Robot, vol. 40, pp. 929–953, June 2016.
- A. Farinelli, L. Iocchi, and D. Nardi, “Distributed on-line dynamic task assignment for multi-robot patrolling,” Auton Robot, vol. 41, pp. 1321–1345, Aug. 2017.
- B. Wiandt, V. Simon, and A. Kőkuti, “Self-organized graph partitioning approach for multi-agent patrolling in generic graphs,” in IEEE EUROCON 2017 -17th International Conference on Smart Technologies, pp. 605–610, July 2017.
- K. Kobayashi, S. Ueno, and T. Higuchi, “Multi-Robot Patrol Algorithm with Distributed Coordination and Consciousness of the Base Station’s Situation Awareness,” Oct. 2023.
- H. ElGibreen and K. Youcef-Toumi, “Dynamic task allocation in an uncertain environment with heterogeneous multi-agents,” Auton Robot, vol. 43, pp. 1639–1664, Oct. 2019.
- A. Goeckner, X. Li, E. Wei, and Q. Zhu, “Attrition-Aware Adaptation for Multi-Agent Patrolling,” Jan. 2024.
- L. Guo, H. Pan, X. Duan, and J. He, “Balancing Efficiency and Unpredictability in Multi-robot Patrolling: A MARL-Based Approach,” in 2023 IEEE International Conference on Robotics and Automation (ICRA), pp. 3504–3509, May 2023.
- C. Tong, A. Harwood, M. A. Rodriguez, and R. O. Sinnott, “An Energy-aware and Fault-tolerant Deep Reinforcement Learning based approach for Multi-agent Patrolling Problems,” June 2023.
- C. Boutilier, “Planning, learning and coordination in multiagent decision processes,” in Proceedings of the 6th Conference on Theoretical Aspects of Rationality and Knowledge, TARK ’96, (San Francisco, CA, USA), pp. 195–210, Morgan Kaufmann Publishers Inc., Mar. 1996.
- C. Yu, A. Velu, E. Vinitsky, J. Gao, Y. Wang, A. Bayen, and Y. Wu, “The Surprising Effectiveness of PPO in Cooperative Multi-Agent Games,” Advances in Neural Information Processing Systems, vol. 35, pp. 24611–24624, Dec. 2022.
- S. Nayak, K. Choi, W. Ding, S. Dolan, K. Gopalakrishnan, and H. Balakrishnan, “Scalable Multi-Agent Reinforcement Learning through Intelligent Information Aggregation,” May 2023.
- H. Yoshitake and P. Abbeel, “The Impact of Overall Optimization on Warehouse Automation,” Aug. 2023.
- J. Terry, B. Black, N. Grammel, M. Jayakumar, A. Hari, R. Sullivan, L. S. Santos, C. Dieffendahl, C. Horsch, R. Perez-Vicente, N. Williams, Y. Lokesh, and P. Ravi, “PettingZoo: Gym for Multi-Agent Reinforcement Learning,” in Advances in Neural Information Processing Systems, vol. 34, pp. 15032–15043, Curran Associates, Inc., 2021.
- S. Macenski, T. Foote, B. Gerkey, C. Lalancette, and W. Woodall, “Robot Operating System 2: Design, architecture, and uses in the wild,” Science Robotics, May 2022.