Papers

Topics

Authors

Recent

View all

Assistant

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 74 tok/s

Gemini 2.5 Pro 55 tok/s Pro

GPT-5 Medium 19 tok/s Pro

GPT-5 High 24 tok/s Pro

GPT-4o 109 tok/s Pro

Kimi K2 212 tok/s Pro

GPT OSS 120B 464 tok/s Pro

Claude Sonnet 4 37 tok/s Pro

2000 character limit reached

A0C: Alpha Zero in Continuous Action Space (1805.09613v1)

Published 24 May 2018 in stat.ML, cs.AI, cs.LG, cs.RO, and cs.SY

Abstract: A core novelty of Alpha Zero is the interleaving of tree search and deep learning, which has proven very successful in board games like Chess, Shogi and Go. These games have a discrete action space. However, many real-world reinforcement learning domains have continuous action spaces, for example in robotic control, navigation and self-driving cars. This paper presents the necessary theoretical extensions of Alpha Zero to deal with continuous action space. We also provide some preliminary experiments on the Pendulum swing-up task, empirically showing the feasibility of our approach. Thereby, this work provides a first step towards the application of iterated search and learning in domains with a continuous action space.

Citations (48)

View on Semantic Scholar