Variational Deep Q Network (1711.11225v1)

Published 30 Nov 2017 in cs.LG, cs.AI, and stat.ML

Abstract: We propose a framework that directly tackles the probability distribution of the value function parameters in Deep Q Network (DQN), with powerful variational inference subroutines to approximate the posterior of the parameters. We will establish the equivalence between our proposed surrogate objective and variational inference loss. Our new algorithm achieves efficient exploration and performs well on large scale chain Markov Decision Process (MDP).

Citations (10)

View on Semantic Scholar

Collections

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Paper Prompts

Explore 10 Community Prompts

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

Generate Now

Variational Deep Q Network (1711.11225v1)

Collections

Summary

Paper Prompts

Follow-up Questions

Related Papers

Authors (2)