2000 character limit reached
Exploration by Distributional Reinforcement Learning (1805.01907v2)
Published 4 May 2018 in cs.LG, cs.AI, and stat.ML
Abstract: We propose a framework based on distributional reinforcement learning and recent attempts to combine Bayesian parameter updates with deep reinforcement learning. We show that our proposed framework conceptually unifies multiple previous methods in exploration. We also derive a practical algorithm that achieves efficient exploration on challenging control tasks.