Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
166 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
42 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Efficient Constraint Generation for Stochastic Shortest Path Problems (2401.14636v1)

Published 26 Jan 2024 in cs.AI

Abstract: Current methods for solving Stochastic Shortest Path Problems (SSPs) find states' costs-to-go by applying BeLLMan backups, where state-of-the-art methods employ heuristics to select states to back up and prune. A fundamental limitation of these algorithms is their need to compute the cost-to-go for every applicable action during each state backup, leading to unnecessary computation for actions identified as sub-optimal. We present new connections between planning and operations research and, using this framework, we address this issue of unnecessary computation by introducing an efficient version of constraint generation for SSPs. This technique allows algorithms to ignore sub-optimal actions and avoid computing their costs-to-go. We also apply our novel technique to iLAO* resulting in a new algorithm, CG-iLAO*. Our experiments show that CG-iLAO* ignores up to 57% of iLAO*'s actions and it solves problems up to 8x and 3x faster than LRTDP and iLAO*.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (26)
  1. Heuristic Search Planning With Multi-Objective Probabilistic LTL Constraints. In Proc. of 16th Int. Conf. on Principles of Knowledge Representation and Reasoning (KR).
  2. Bellman, R. 1957. Dynamic programming. Princeton University Press.
  3. Bertsekas, D. 1995. Dynamic Programming and Optimal Control, volume 2. Athena Scientific.
  4. An Analysis of Stochastic Shortest Path Problems. Mathematics of Operations Research.
  5. Introduction to Linear Optimization. Athena Scientific.
  6. Labeled RTDP: Improving the Convergence of Real-Time Dynamic Programming. In Proc. of 13th Int. Conf. on Automated Planning and Scheduling (ICAPS).
  7. Verification of Markov Decision Processes Using Learning Algorithms. In Automated Technology for Verification and Analysis.
  8. Buffet, D. 2008. International Planning Competition Uncertainty Part: Benchmarks and Results.
  9. Multi-Agent Planning with Mixed-Integer Programming and Adaptive Interaction Constraint Generation (Extended Abstract). Proc. of 14th Symposium on Combinatorial Search (SoCS).
  10. Focused topological value iteration. In Proc. of 19th Int. Conf. on Automated Planning and Scheduling (ICAPS), volume 19, 82–89.
  11. LAO*{}^{*}start_FLOATSUPERSCRIPT * end_FLOATSUPERSCRIPT: A heuristic search algorithm that finds solutions with loops. Artificial Intelligence.
  12. Landmarks, Critical Paths and Abstractions: What’s the Difference Anyway? Proc. of 19th Int. Conf. on Automated Planning and Scheduling (ICAPS).
  13. Probabilistic planning vs. replanning. In ICAPS Workshop on IPC: Past, Present and Future.
  14. Progression Heuristics for Planning with Probabilistic LTL Constraints. In Proc. of 35th AAAI Conference on Artificial Intelligence.
  15. Bounded Real-Time Dynamic Programming: RTDP with Monotone Upper Bounds and Performance Guarantees. In Proc. of 22nd Int. Conf. on Machine Learning.
  16. Bayesian real-time dynamic programming. In Proc. of 21st Int. Joint Conf. on AI (IJCAI).
  17. Metric Hybrid Factored Planning in Nonlinear Domains with Constraint Generation, 502–518. Springer International Publishing.
  18. Code, benchmarks, and technical report for AAAI 2024 paper “Efficient Constraint Generation for Stochastic Shortest Path Problems”. https://doi.org/10.5281/zenodo.10344842.
  19. Direct value-approximation for factored MDPs. In Advances in Neural Information Processing Systems (NeurIPS).
  20. Focused real-time dynamic programming for MDPs: Squeezing more out of a heuristic. In Proc. of 20th AAAI Conf. on Artificial Intelligence.
  21. Teichteil-Königsbuch, F. 2012. Stochastic safest and shortest path problems. In Proc. of 26th AAAI Conf. on AI.
  22. Planning under Risk and Knightian Uncertainty. In Proc. of 20th Int. Joint Conf. on AI (IJCAI).
  23. Efficient Solutions for Stochastic Shortest Path Problems with Dead Ends. In Proc. of 33rd Int. Conf. on Uncertainty in Artificial Intelligence (UAI).
  24. Occupation Measure Heuristics for Probabilistic Planning. In Proc. of 27th Int. Conf. on Automated Planning and Scheduling (ICAPS).
  25. Accelerated Vector Pruning for Optimal POMDP Solvers. Proc. of 31st AAAI Conf. on AI.
  26. Markov decision processes with imprecise transition probabilities. Operations Research, 42(4): 739–749.
Citations (1)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com

Tweets