Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks (1809.01906v2)

Published 6 Sep 2018 in cs.LG and stat.ML

Abstract: This paper proposes a new optimization objective for value-based deep reinforcement learning. We extend conventional Deep Q-Networks (DQNs) by adding a model-learning component yielding a transcoder network. The prediction errors for the model are included in the basic DQN loss as additional regularizers. This augmented objective leads to a richer training signal that provides feedback at every time step. Moreover, because learning an environment model shares a common structure with the RL problem, we hypothesize that the resulting objective improves both sample efficiency and performance. We empirically confirm our hypothesis on a range of 20 games from the Atari benchmark attaining superior results over vanilla DQN without model-based regularization.

Citations (7)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Model-Based Regularization for Deep Reinforcement Learning with Transcoder Networks (1809.01906v2)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (2)