Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Recursively-Constrained Partially Observable Markov Decision Processes (2310.09688v3)

Published 15 Oct 2023 in cs.AI and cs.RO

Abstract: Many sequential decision problems involve optimizing one objective function while imposing constraints on other objectives. Constrained Partially Observable Markov Decision Processes (C-POMDP) model this case with transition uncertainty and partial observability. In this work, we first show that C-POMDPs violate the optimal substructure property over successive decision steps and thus may exhibit behaviors that are undesirable for some (e.g., safety critical) applications. Additionally, online re-planning in C-POMDPs is often ineffective due to the inconsistency resulting from this violation. To address these drawbacks, we introduce the Recursively-Constrained POMDP (RC-POMDP), which imposes additional history-dependent cost constraints on the C-POMDP. We show that, unlike C-POMDPs, RC-POMDPs always have deterministic optimal policies and that optimal policies obey BeLLMan's principle of optimality. We also present a point-based dynamic programming algorithm for RC-POMDPs. Evaluations on benchmark problems demonstrate the efficacy of our algorithm and show that policies for RC-POMDPs produce more desirable behaviors than policies for C-POMDPs.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (8)
  1. Qi Heng Ho (12 papers)
  2. Tyler Becker (5 papers)
  3. Benjamin Kraske (3 papers)
  4. Zakariya Laouar (5 papers)
  5. Martin S. Feather (5 papers)
  6. Federico Rossi (29 papers)
  7. Morteza Lahijanian (59 papers)
  8. Zachary N. Sunberg (20 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.