Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
41 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
41 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

On the Convergence and Sample Complexity Analysis of Deep Q-Networks with $ε$-Greedy Exploration (2310.16173v1)

Published 24 Oct 2023 in cs.LG

Abstract: This paper provides a theoretical understanding of Deep Q-Network (DQN) with the $\varepsilon$-greedy exploration in deep reinforcement learning. Despite the tremendous empirical achievement of the DQN, its theoretical characterization remains underexplored. First, the exploration strategy is either impractical or ignored in the existing analysis. Second, in contrast to conventional Q-learning algorithms, the DQN employs the target network and experience replay to acquire an unbiased estimation of the mean-square BeLLMan error (MSBE) utilized in training the Q-network. However, the existing theoretical analysis of DQNs lacks convergence analysis or bypasses the technical challenges by deploying a significantly overparameterized neural network, which is not computationally efficient. This paper provides the first theoretical convergence and sample complexity analysis of the practical setting of DQNs with $\epsilon$-greedy policy. We prove an iterative procedure with decaying $\epsilon$ converges to the optimal Q-value function geometrically. Moreover, a higher level of $\epsilon$ values enlarges the region of convergence but slows down the convergence, while the opposite holds for a lower level of $\epsilon$ values. Experiments justify our established theoretical insights on DQNs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Shuai Zhang (319 papers)
  2. Hongkang Li (14 papers)
  3. Meng Wang (1063 papers)
  4. Miao Liu (98 papers)
  5. Pin-Yu Chen (311 papers)
  6. Songtao Lu (60 papers)
  7. Sijia Liu (204 papers)
  8. Keerthiram Murugesan (38 papers)
  9. Subhajit Chaudhury (40 papers)
Citations (15)