Model-Based Episodic Memory Induces Dynamic Hybrid Controls (2111.02104v2)

Published 3 Nov 2021 in cs.LG and cs.AI

Abstract: Episodic control enables sample efficiency in reinforcement learning by recalling past experiences from an episodic memory. We propose a new model-based episodic memory of trajectories addressing current limitations of episodic control. Our memory estimates trajectory values, guiding the agent towards good policies. Built upon the memory, we construct a complementary learning model via a dynamic hybrid control unifying model-based, episodic and habitual learning into a single architecture. Experiments demonstrate that our model allows significantly faster and better learning than other strong reinforcement learning agents across a variety of environments including stochastic and non-Markovian settings.

Authors (5)

Hung Le (120 papers)
Thommen Karimpanal George (6 papers)
Majid Abdolshah (10 papers)
Truyen Tran (112 papers)
Svetha Venkatesh (160 papers)

Citations (16)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Model-Based Episodic Memory Induces Dynamic Hybrid Controls (2111.02104v2)

Summary

Related Papers