Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Measuring and Characterizing Generalization in Deep Reinforcement Learning (1812.02868v2)

Published 7 Dec 2018 in cs.LG, cs.AI, and stat.ML

Abstract: Deep reinforcement-learning methods have achieved remarkable performance on challenging control tasks. Observations of the resulting behavior give the impression that the agent has constructed a generalized representation that supports insightful action decisions. We re-examine what is meant by generalization in RL, and propose several definitions based on an agent's performance in on-policy, off-policy, and unreachable states. We propose a set of practical methods for evaluating agents with these definitions of generalization. We demonstrate these techniques on a common benchmark task for deep RL, and we show that the learned networks make poor decisions for states that differ only slightly from on-policy states, even though those states are not selected adversarially. Taken together, these results call into question the extent to which deep Q-networks learn generalized representations, and suggest that more experimentation and analysis is necessary before claims of representation learning can be supported.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Sam Witty (6 papers)
  2. Jun Ki Lee (7 papers)
  3. Emma Tosch (7 papers)
  4. Akanksha Atrey (7 papers)
  5. Michael Littman (17 papers)
  6. David Jensen (66 papers)
Citations (59)

Summary

We haven't generated a summary for this paper yet.