Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
169 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Policy-Space Search: Equivalences, Improvements, and Compression (2403.19883v1)

Published 28 Mar 2024 in cs.AI

Abstract: Fully-observable non-deterministic (FOND) planning is at the core of artificial intelligence planning with uncertainty. It models uncertainty through actions with non-deterministic effects. A* with Non-Determinism (AND*) (Messa and Pereira, 2023) is a FOND planner that generalizes A* (Hart et al., 1968) for FOND planning. It searches for a solution policy by performing an explicit heuristic search on the policy space of the FOND task. In this paper, we study and improve the performance of the policy-space search performed by AND*. We present a polynomial-time procedure that constructs a solution policy given just the set of states that should be mapped. This procedure, together with a better understanding of the structure of FOND policies, allows us to present three concepts of equivalences between policies. We use policy equivalences to prune part of the policy search space, making AND* substantially more effective in solving FOND tasks. We also study the impact of taking into account structural state-space symmetries to strengthen the detection of equivalence policies and the impact of performing the search with satisficing techniques. We apply a recent technique from the group theory literature to better compute structural state-space symmetries. Finally, we present a solution compressor that, given a policy defined over complete states, finds a policy that unambiguously represents it using the minimum number of partial states. AND* with the introduced techniques generates, on average, two orders of magnitude fewer policies to solve FOND tasks. These techniques allow explicit policy-space search to be competitive in terms of both coverage and solution compactness with other state-of-the-art FOND planners.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (19)
  1. Complexity results for sas+ planning. Computational Intelligence 11, 625–655.
  2. Weak, strong, and strong cyclic planning via symbolic model checking. Artificial Intelligence 147, 35–84.
  3. Experiments with the graph traverser program, in: Proceedings of the Royal Society of London. Series A. Mathematical and Physical Sciences, pp. 235–259.
  4. Heuristic Search: Theory and Applications. Elsevier.
  5. Strips: A new approach to the application of theorem proving to problem solving. Artificial intelligence 2, 189–208.
  6. Compact policies for fully observable non-deterministic planning as sat, in: Proceedings of the International Conference on Automated Planning and Scheduling, pp. 88–96.
  7. A formal basis for the heuristic determination of minimum cost oaths. IEEE transactions on Systems Science and Cybernetics 4, 100–107.
  8. Concise finite-domain representations for pddl planning tasks. Artificial Intelligence 173, 503–535.
  9. Landmarks, critical paths and abstractions: what’s the difference anyway?, in: Proceedings of the International Conference on Automated Planning and Scheduling, pp. 162–169.
  10. Minimal and canonical images. Journal of Algebra 521, 481–506.
  11. Practical graph isomorphism, ii. Journal of Symbolic Computation 60, 94–112.
  12. A best-first search algorithm for fond planning and heuristic functions to optimize decompressed solution size, in: Proceedings of the International Conference on Automated Planning and Scheduling, pp. 277–285.
  13. Improved non-deterministic planning by exploiting state relevance, in: Proceedings of the International Conference on Automated Planning and Scheduling, pp. 172–180.
  14. Iterative depth-first search for fond planning, in: Proceedings of the International Conference on Automated Planning and Scheduling, pp. 90–99.
  15. Exploiting problem symmetries in state-based planners, in: Proceedings of the AAAI conference on artificial intelligence, pp. 1004–1009.
  16. Heuristic search viewed as path finding in a graph. Artificial Intelligence 1, 193–204.
  17. Artificial Intelligence: A Modern Approach. Pearson.
  18. Heuristics and symmetries in classical planning, in: Proceedings of the AAAI Conference on Artificial Intelligence, pp. 3371–3377.
  19. Structural symmetries for fully observable nondeterministic planning, in: Proceedings of the International Joint Conference On Artificial Intelligence, pp. 3293–3299.

Summary

We haven't generated a summary for this paper yet.