A coevolutionary approach to deep multi-agent reinforcement learning (2104.05610v2)

Published 12 Apr 2021 in cs.NE, cs.LG, and cs.MA

Abstract: Traditionally, Deep Artificial Neural Networks (DNN's) are trained through gradient descent. Recent research shows that Deep Neuroevolution (DNE) is also capable of evolving multi-million-parameter DNN's, which proved to be particularly useful in the field of Reinforcement Learning (RL). This is mainly due to its excellent scalability and simplicity compared to the traditional MDP-based RL methods. So far, DNE has only been applied to complex single-agent problems. As evolutionary methods are a natural choice for multi-agent problems, the question arises whether DNE can also be applied in a complex multi-agent setting. In this paper, we describe and validate a new approach based on Coevolution. To validate our approach, we benchmark two Deep Coevolutionary Algorithms on a range of multi-agent Atari games and compare our results against the results of Ape-X DQN. Our results show that these Deep Coevolutionary algorithms (1) can be successfully trained to play various games, (2) outperform Ape-X DQN in some of them, and therefore (3) show that Coevolution can be a viable approach to solving complex multi-agent decision-making problems.

Authors (2)

Daan Klijn (1 paper)
A. E. Eiben (24 papers)

Citations (5)

View on Semantic Scholar

Summary

Coevolutionary Approaches in Deep Multi-agent Reinforcement Learning

The paper by Klijn and Eiben explores the application of deep neuroevolution techniques in the context of multi-agent reinforcement learning (MARL) through the lens of coevolutionary algorithms. Building upon the demonstrated efficacy of deep neuroevolution (DNE) in single-agent reinforcement learning scenarios, the researchers aim to extend these methods to multi-agent problems, adding a novel dimension to the field by incorporating evolutionary strategies (ES) and genetic algorithms (GA) within a coevolutionary framework.

Key Insights and Methodological Framework

The research leverages coevolution principles, traditionally applied in evolutionary computation, to evolve strategies for multiple interacting agents. The paper introduces two distinct coevolutionary algorithms: Coevolutionary Evolution Strategies (Co-ES) and Coevolutionary Genetic Algorithms (Co-GA). The choice of these methodologies stems from their scalability and demonstrated capability in managing multi-million-parameter deep neural networks without relying on gradient information, thereby positioning them as suitable candidates for the complexities inherent in MARL scenarios.

The authors' methodological innovation lies in adapting ES and GA to incorporate coevolutionary dynamics, each emphasizing evaluation mechanisms that account for the interplay between multiple agents. This approach is validated across a series of multiplayer Atari games, utilizing benchmarks like Ape-X DQN for comparative analysis.

Computational Environment and Experimental Setup

The neural network architecture utilized aligns closely with established configurations in the reinforcement learning domain, notably the large DQN model. This consistency ensures that the derived results are attributable to algorithmic innovations rather than divergences in model complexity. The paper’s computational strategy employs robust preprocessing and frame-skipping techniques to manage input dimensionality and maintain training efficiency.

Hyperparameters are finely tuned to reflect the distinct optimization pathways of Co-ES and Co-GA, with the algorithms showing a significant performance boost in environments characterized by dynamic multi-agent interactions.

Results and Comparative Analysis

The results underscore the efficacy of coevolutionary methods, with Co-ES and Co-GA outperforming Ape-X DQN in several test environments. For instance, in tasks requiring adaptive interactions, such as Combat and Joust, the coevolutionary techniques demonstrate superior capability in evolving functional strategies.

However, the paper also acknowledges limits in scenarios with sparse rewards, where traditional RL methods might retain an edge due to the constrained search space and reward granularity. This suggests an area ripe for further refinement, possibly through hybrid models that leverage the strengths of gradient-based and evolutionary approaches.

Implications and Future Directions

This research contributes to the broader understanding of MARL by illustrating how coevolution can enhance agent performance in non-stationary environments. Its implications are twofold: practically, it offers a robust alternative for complex multi-agent environments where traditional RL methods face scalability issues; theoretically, it invites further exploration into hybrid models that combine evolutionary and RL paradigms.

Future research could expand upon these foundations by exploring varied coevolutionary dynamics, leveraging historical data through innovative Hall of Fame mechanisms, or incorporating Pareto optimization strategies to refine agent behavior further.

In conclusion, this paper effectively positions coevolution as a promising paradigm within deep MARL, advocating for continued interdisciplinary exploration that bridges evolutionary computation and reinforcement learning. The insights offered here pave the way for more nuanced algorithmic designs that can address the emergent complexities of cooperative and competitive multi-agent ecosystems.

Related Papers

YouTube

Show All Videos