Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Provable Representation with Efficient Planning for Partial Observable Reinforcement Learning (2311.12244v3)

Published 20 Nov 2023 in cs.LG, cs.AI, and stat.ML

Abstract: In most real-world reinforcement learning applications, state information is only partially observable, which breaks the Markov decision process assumption and leads to inferior performance for algorithms that conflate observations with state. Partially Observable Markov Decision Processes (POMDPs), on the other hand, provide a general framework that allows for partial observability to be accounted for in learning, exploration and planning, but presents significant computational and statistical challenges. To address these difficulties, we develop a representation-based perspective that leads to a coherent framework and tractable algorithmic approach for practical reinforcement learning from partial observations. We provide a theoretical analysis for justifying the statistical efficiency of the proposed algorithm, and also empirically demonstrate the proposed algorithm can surpass state-of-the-art performance with partial observations across various benchmarks, advancing reliable reinforcement learning towards more practical applications.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Hongming Zhang (111 papers)
  2. Tongzheng Ren (32 papers)
  3. Chenjun Xiao (21 papers)
  4. Dale Schuurmans (112 papers)
  5. Bo Dai (245 papers)
Citations (3)

Summary

We haven't generated a summary for this paper yet.