2000 character limit reached
Model-Based Episodic Memory Induces Dynamic Hybrid Controls (2111.02104v2)
Published 3 Nov 2021 in cs.LG and cs.AI
Abstract: Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our memory estimates trajectory values, guiding the agent towards good policies. Built upon the memory, we construct a complementary learning model via a dynamic hybrid control unifying model-based, episodic and habitual learning into a single architecture. Experiments demonstrate that our model allows significantly faster and better learning than other strong reinforcement learning agents across a variety of environments including stochastic and non-Markovian settings.
- Hung Le (120 papers)
- Thommen Karimpanal George (6 papers)
- Majid Abdolshah (10 papers)
- Truyen Tran (112 papers)
- Svetha Venkatesh (160 papers)