Deep Learning for Video Game Playing (1708.07902v3)

Published 25 Aug 2017 in cs.AI

Abstract: In this article, we review recent Deep Learning advances in the context of how they have been applied to play different types of video games such as first-person shooters, arcade games, and real-time strategy games. We analyze the unique requirements that different game genres pose to a deep learning system and highlight important open challenges in the context of applying these machine learning methods to video games, such as general game playing, dealing with extremely large decision spaces and sparse rewards.

Citations (197)

View on Semantic Scholar

Summary

The paper reviews the application of deep learning techniques across various video game genres, assessing their capabilities, limitations, and challenges.
It details specific methods like DQN, A3C, DDPG, and their variants, showcasing how they address genre-specific issues in games from Atari to StarCraft.
The review highlights limitations such as computational demands and model robustness, outlining future directions including handling sparse rewards, developing general agents, and industry applications.

Deep Learning for Video Game Playing

The paper "Deep Learning for Video Game Playing" systematically reviews the application of deep learning techniques across various genres of video games, including arcade, racing, first-person shooters, open-world, real-time strategy, team sports, physics, and text adventure games. It assesses the capabilities and limitations of different deep learning models in tackling the unique challenges posed by these diverse genres, emphasizing the computational demands, decision-making complexities, and the handling of sparse rewards endemic to video game environments.

Summary of Techniques and Results

The document comprehensively catalogues deep learning strategies such as Deep Q-Network (DQN), Asynchronous Advantage Actor-Critic (A3C), and Deep Deterministic Policy Gradient (DDPG), detailing how they adapt to the distinct requirements of video game challenges. DQN initially showcased its effectiveness with promising results against human level performances in Atari 2600 games by utilizing a convolutional neural network to approximate Q-values from screen pixels. Extensions like Double DQN, Dueling DQN, and Rainbow have improved upon these results by employing refined learning architectures and exploration strategies, achieving higher scores in a significant number of games.

For more computationally intensive genres like first-person shooters, approaches such as DRQN (combining recurrent neural networks with Q-learning) have successfully addressed partial observability issues, while frameworks like IMPALA have scaled learning to multiple game instances by leveraging actor-learner architectures. Novel methods like the Direct Future Prediction (DFP) in Doom represent an alternative to typical reinforcement learning schemes, using supervised learning to forecast game states and thereby guide actions more effectively.

In complex real-time strategy games like StarCraft, advances have been slower, primarily focusing on sub-tasks such as unit micromanagement and build-order planning through techniques like Counterfactual Multi-Agent (COMA) policy gradients and hierarchical learning models. Text-based adventure games have seen development through language-informed approaches such as Deep Reinforcement Relevance Networks (DRRN), which embed state-action semantics into learning frameworks for better decision making in these narrative-rich environments.

Implications and Future Directions

The review underscores that despite the achievements in surpassing human performance in many video games, several limitations persist. Notably, deep learning models require vast computational resources and extensive training times that might not be feasible for all commercial applications. Additionally, the robustness of these models in adapting to novel and diverse game environments without substantial retraining is yet to be fully realized.

The document highlights potential future directions, such as improving models to handle sparse, deceptive, or delayed rewards, and developing general game-playing agents capable of adapting across genres. Multi-agent learning frameworks are presented as an attractive but challenging frontier, with the potential to transform how bots interact within the complex ecosystems of modern video games. These challenges also open avenues for integrating lifelong adaptation and human-like behaviors into game-playing AI, making them more believable and engaging in multiplayer settings.

In the industry context, adopting deep learning for non-play aspects like game testing and procedural content generation remains underexplored but promising. Effective translation of these advanced AI systems into interactive tools that game developers can integrate seamlessly into the development pipeline is crucial. The paper suggests leveraging preference-based learning and interactive design collaborations as a bridge to wider adoption in the games industry.

Conclusion

Overall, this paper acts as a valuable resource for researchers aiming to enhance AI proficiency in video game environments. It not only outlines the state-of-the-art achievements but also calls attention to the demanding aspects where current techniques fall short, providing clear paths for future exploration and refinement. With ongoing innovation and interdisciplinary collaboration, deep learning is poised to not only redefine game playing AI but also contribute to broader applications in AI development.