Attention-Driven Multi-Agent Reinforcement Learning: Enhancing Decisions with Expertise-Informed Tasks (2404.05840v3)
Abstract: In this paper, we introduce an alternative approach to enhancing Multi-Agent Reinforcement Learning (MARL) through the integration of domain knowledge and attention-based policy mechanisms. Our methodology focuses on the incorporation of domain-specific expertise into the learning process, which simplifies the development of collaborative behaviors. This approach aims to reduce the complexity and learning overhead typically associated with MARL by enabling agents to concentrate on essential aspects of complex tasks, thus optimizing the learning curve. The utilization of attention mechanisms plays a key role in our model. It allows for the effective processing of dynamic context data and nuanced agent interactions, leading to more refined decision-making. Applied in standard MARL scenarios, such as the Stanford Intelligent Systems Laboratory (SISL) Pursuit and Multi-Particle Environments (MPE) Simple Spread, our method has been shown to improve both learning efficiency and the effectiveness of collaborative behaviors. The results indicate that our attention-based approach can be a viable approach for improving the efficiency of MARL training process, integrating domain-specific knowledge at the action level.
- 2019. Dota 2 with large scale deep reinforcement learning. arXiv preprint arXiv:1912.06680.
- 2023. Benchmarl: Benchmarking multi-agent reinforcement learning. arXiv preprint arXiv:2006.07869.
- 2021. A survey on multi-agent deep reinforcement learning: from the perspective of challenges and applications. Artificial Intelligence Review 54(5):3215–3238.
- 2023. Selectively sharing experiences improves multi-agent reinforcement learning. arXiv preprint arXiv:2311.00865.
- 2022. Towards a standardised performance evaluation protocol for cooperative marl. In Advances in Neural Information Processing Systems, volume 35, 5510–5521.
- 2020. Improving multi-agent reinforcement learning with imperfect human knowledge. In Lecture Notes in Computer Science (including subseries Lecture Notes in Artificial Intelligence and Lecture Notes in Bioinformatics), volume 12397, 369–380. Springer.
- 2023. Marl-lib: A scalable and efficient library for multi-agent reinforcement learning. Journal of Machine Learning Research 24:1–23.
- 2019. Actor-attention-critic for multi-agent reinforcement learning. In International conference on machine learning, 2961–2970. PMLR.
- 2019. Multi-agent deep reinforcement learning with human strategies. In Proceedings of the IEEE International Conference on Industrial Technology, 1357–1362. IEEE.
- 2021. A review on the attention mechanism of deep learning. Neurocomputing 452:48–62.
- 2021. Multi-agent reinforcement learning for resource allocation in iot networks. IEEE Access 9:93533–93546.
- 2021. Benchmarking multi-agent deep reinforcement learning algorithms in cooperative tasks. Thirty-fifth Conference on Neural Information Processing Systems Datasets and Benchmarks Track (Round 1).
- 2021. Hierarchical reinforcement learning for air-to-air combat. In 2021 International Conference on Unmanned Aircraft Systems, ICUAS 2021, 275–284. IEEE.
- 2022. Attention enhanced reinforcement learning for multi agent cooperation. IEEE Transactions on Neural Networks and Learning Systems.
- 2021. Attention based multi-agent intrusion detection systems using reinforcement learning. Journal of Information Security and Applications 61:102923.
- 2021. Greedy unmixing for q-learning in multi-agent reinforcement learning. arXiv preprint arXiv:2109.09034.
- 2021. Pettingzoo: Gym for multi-agent reinforcement learning. In Advances in Neural Information Processing Systems, volume 34, 15032–15043.
- 2017. Attention is all you need. In Advances in Neural Information Processing Systems, volume 30.
- 2019. Grandmaster level in starcraft ii using multi-agent reinforcement learning. Nature 575(7782):350–354.
- 2020. Learning hierarchical behavior and motion planning for autonomous driving. In IEEE International Conference on Intelligent Robots and Systems, 2235–2242. IEEE.
- 2023. A multi-agent flocking collaborative control method for stochastic dynamic environment via graph attention autoencoder based reinforcement learning. Neurocomputing 126379.
- 2021. Optimizing task scheduling in human-robot collaboration with deep multi-agent reinforcement learning. Journal of Manufacturing Systems 60:287–297.
- 2021. Multi-agent reinforcement learning: A selective overview of theories and algorithms. In Handbook of reinforcement learning and control. Springer. 321–384.
- 2021. Towards self-x cognitive manufacturing network: An industrial knowledge graph-based multi-agent reinforcement learning approach. Journal of Manufacturing Systems 60:373–382.