2000 character limit reached
Policy Optimization with Linear Temporal Logic Constraints (2206.09546v2)
Published 20 Jun 2022 in cs.LG, cs.AI, and cs.LO
Abstract: We study the problem of policy optimization (PO) with linear temporal logic (LTL) constraints. The language of LTL allows flexible description of tasks that may be unnatural to encode as a scalar cost function. We consider LTL-constrained PO as a systematic framework, decoupling task specification from policy selection, and as an alternative to the standard of cost shaping. With access to a generative model, we develop a model-based approach that enjoys a sample complexity analysis for guaranteeing both task satisfaction and cost optimality (through a reduction to a reachability problem). Empirically, our algorithm can achieve strong performance even in low-sample regimes.
- Cameron Voloshin (6 papers)
- Hoang M. Le (15 papers)
- Swarat Chaudhuri (61 papers)
- Yisong Yue (154 papers)