Move Evaluation in Go Using Deep Convolutional Neural Networks (1412.6564v2)

Published 20 Dec 2014 in cs.LG and cs.NE

Abstract: The game of Go is more challenging than other board games, due to the difficulty of constructing a position or move evaluation function. In this paper we investigate whether deep convolutional networks can be used to directly represent and learn this knowledge. We train a large 12-layer convolutional neural network by supervised learning from a database of human professional games. The network correctly predicts the expert move in 55% of positions, equalling the accuracy of a 6 dan human player. When the trained convolutional network was used directly to play games of Go, without any search, it beat the traditional search program GnuGo in 97% of games, and matched the performance of a state-of-the-art Monte-Carlo tree search that simulates a million positions per move.

Citations (134)

View on Semantic Scholar

Summary

The paper's main contribution is demonstrating that a 12-layer deep CNN can achieve 55% move prediction accuracy, comparable to a 6 dan human player.
It shows that using the CNN alone yields a 97% win rate against GnuGo, matching the performance of Monte-Carlo tree search programs.
The study reveals that integrating CNNs with Monte-Carlo tree search enhances performance by 87%, highlighting the promise of hybrid AI strategies.

Deep Convolutional Neural Networks for Move Evaluation in Go

In recent years, the use of deep learning techniques has significantly altered the landscape of artificial intelligence applied to strategic games like Go. The paper "Move Evaluation in Go Using Deep Convolutional Neural Networks" thoroughly investigates the application of deep convolutional neural networks (CNNs) to directly represent and learn Go knowledge. The research focuses on developing a system capable of predicting expert moves and compares its efficacy to traditional and Monte-Carlo tree search methods.

Insights and Results

The game of Go presents a unique challenge due to its complexity and the difficulty associated with constructing a reliable move evaluation function. This paper trains a 12-layer CNN using supervised learning from a database of professional Go games, resulting in a notable prediction accuracy of 55%, comparable to a 6 dan human player. This performance is particularly noteworthy when compared to previous methods, which achieved significantly lower prediction accuracy rates.

Remarkably, when using the CNN to play Go without any search techniques, it achieved an impressive 97% win rate against GnuGo, a traditional search-based Go program. It also matched the performance of Monte-Carlo tree search programs such as Pachi and MoGo, which simulate large numbers of positions per move. These implementations highlight the potential of deep learning architectures to encapsulate and utilize strategic knowledge in complex decision-making environments.

The authors further enhance the system's efficiency by integrating the CNN with Monte-Carlo tree search in an asynchronous evaluation setup. This integration shows an 87% improvement in games against the baseline CNN. This combination suggests a promising direction for future research, wherein a hybrid approach can capitalize on the strengths of both deep learning and search methodologies.

Theoretical and Practical Implications

The use of deep CNNs in Go addresses longstanding challenges in constructing evaluative functions capable of assessing move quality without explicit search. By demonstrating that a substantial increase in network depth and size yields superior performance, the paper provides compelling evidence that powerful move evaluation functions can indeed be represented and learned through deep neural networks.

This advancement has practical implications beyond the immediate scope of Go. The findings may extend to other domains requiring complex decision-making within vast state spaces, suggesting that similar neural network architectures could potentially benefit various AI applications.

Future Developments

The integration of CNNs with Monte-Carlo tree search is still in its early stages, and further research could explore more efficient ways to synchronize these evaluations. As parallel processing capabilities, like those of GPUs, improve, opportunities for enhancing scalable planning and evaluation functions will expand. This trajectory could pave the way for even stronger AI systems with near-human common sense reasoning and tactical problem-solving skills.

In conclusion, this paper contributes significantly to understanding how deep CNNs can serve strategic applications. It lays a solid foundation for ongoing exploration into integrating deep learning with traditional AI methods to push the boundaries of artificial intelligence in strategic environments.

PDF Markdown

Related Papers

YouTube

Show All Videos