Partially Observable Task and Motion Planning with Uncertainty and Risk Awareness (2403.10454v2)
Abstract: Integrated task and motion planning (TAMP) has proven to be a valuable approach to generalizable long-horizon robotic manipulation and navigation problems. However, the typical TAMP problem formulation assumes full observability and deterministic action effects. These assumptions limit the ability of the planner to gather information and make decisions that are risk-aware. We propose a strategy for TAMP with Uncertainty and Risk Awareness (TAMPURA) that is capable of efficiently solving long-horizon planning problems with initial-state and action outcome uncertainty, including problems that require information gathering and avoiding undesirable and irreversible outcomes. Our planner reasons under uncertainty at both the abstract task level and continuous controller level. Given a set of closed-loop goal-conditioned controllers operating in the primitive action space and a description of their preconditions and potential capabilities, we learn a high-level abstraction that can be solved efficiently and then refined to continuous actions for execution. We demonstrate our approach on several robotics problems where uncertainty is a crucial factor and show that reasoning under uncertainty in these problems outperforms previously proposed determinized planning, direct search, and reinforcement learning strategies. Lastly, we demonstrate our planner on two real-world robotics problems using recent advancements in probabilistic perception.
- Can autonomous vehicles identify, recover from, and adapt to distribution shifts? CoRR, abs/2006.14911, 2020. URL https://arxiv.org/abs/2006.14911.
- Bayesian robust optimization for imitation learning. CoRR, abs/2007.12315, 2020. URL https://arxiv.org/abs/2007.12315.
- Robust policy learning over multiple uncertainty sets, 2022.
- Uncertainty-aware model-based reinforcement learning with application to autonomous driving. CoRR, abs/2106.12194, 2021. URL https://arxiv.org/abs/2106.12194.
- Global planning for contact-rich manipulation via local smoothing of quasi-dynamic contact models, 2023.
- Controlling contact-rich manipulation under partial observability. In Robotics: Science and Systems, 2020. URL https://api.semanticscholar.org/CorpusID:220069704.
- Long-horizon manipulation of unknown objects via task and motion planning with estimated affordances. CoRR, abs/2108.04145, 2021a. URL https://arxiv.org/abs/2108.04145.
- Learning efficient abstract planning models that choose what to predict, 2023.
- Sequence-based plan feasibility prediction for efficient task and motion planning, 2023.
- Lao*: A heuristic search algorithm that finds solutions with loops. Artificial Intelligence, 129(1):35–62, 2001. ISSN 0004-3702. doi: https://doi.org/10.1016/S0004-3702(01)00106-0. URL https://www.sciencedirect.com/science/article/pii/S0004370201001060.
- Learning symbolic operators for task and motion planning. CoRR, abs/2103.00589, 2021. URL https://arxiv.org/abs/2103.00589.
- From word models to world models: Translating from natural language to the probabilistic language of thought, 2023.
- Monte-Carlo planning in large POMDPs. In J. Lafferty, C. Williams, J. Shawe-Taylor, R. Zemel, and A. Culotta, editors, Advances in Neural Information Processing Systems, volume 23. Curran Associates, Inc., 2010.
- DESPOT: online POMDP planning with regularization. CoRR, abs/1609.03250, 2016. URL http://arxiv.org/abs/1609.03250.
- Integrated task and motion planning. CoRR, abs/2010.01083, 2020. URL https://arxiv.org/abs/2010.01083.
- Stripstream: Integrating symbolic planners and blackbox samplers. CoRR, abs/1802.08705, 2018. URL http://arxiv.org/abs/1802.08705.
- Anytime integrated task and motion policies for stochastic environments. CoRR, abs/1904.13006, 2019. URL http://arxiv.org/abs/1904.13006.
- Contingent task and motion planning under uncertainty for human–robot interactions. Applied Sciences, 10(5), 2020. ISSN 2076-3417. doi: 10.3390/app10051665. URL https://www.mdpi.com/2076-3417/10/5/1665.
- Online replanning in belief space for partially observable task and motion problems. CoRR, abs/1911.04577, 2019. URL http://arxiv.org/abs/1911.04577.
- Integrated task and motion planning in belief space. The International Journal of Robotics Research, 32:1194–1227, 08 2013. doi: 10.1177/0278364913484072.
- Modular task and motion planning in belief space. In 2015 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pages 4991–4998, 2015. doi: 10.1109/IROS.2015.7354079.
- Edward Sondik. The optimal control of partially observable markov process over the infinite horizon: Discounted costs. Operations Research, 26:282–304, 04 1978. doi: 10.1287/opre.26.2.282.
- An introduction to sequential Monte Carlo, volume 4. Springer, 2020.
- Probabilistic Robotics (Intelligent Robotics and Autonomous Agents). The MIT Press, 2005. ISBN 0262201623.
- Gen: a general-purpose probabilistic programming system with programmable inference. In Proceedings of the 40th acm sigplan conference on programming language design and implementation, pages 221–236, 2019.
- Probabilistic programming with stochastic probabilities. Proceedings of the ACM on Programming Languages, 7(PLDI):1708–1732, 2023a.
- Smcp3: Sequential monte carlo with probabilistic program proposals. In International Conference on Artificial Intelligence and Statistics, pages 7061–7088. PMLR, 2023b.
- Automating involutive mcmc using probabilistic and differentiable programming. arXiv preprint arXiv:2007.09871, 2020.
- Bayes3d: fast learning and inference in structured generative models of 3d objects and scenes, 2023.
- 3dp3: 3d scene perception via probabilistic programming. Advances in Neural Information Processing Systems, 34:9600–9612, 2021.
- 3d neural embedding likelihood: Probabilistic inverse graphics for robust 6d pose estimation. In Proceedings of the IEEE/CVF International Conference on Computer Vision, pages 21625–21636, 2023.
- Håkan L. S. Younes and Michael L. Littman. Ppddl 1 . 0 : An extension to pddl for expressing planning domains with probabilistic effects. 2004. URL https://api.semanticscholar.org/CorpusID:2767666.
- Weak, strong, and strong cyclic planning via symbolic model checking. Artificial Intelligence, 147(1):35–84, 2003. ISSN 0004-3702. doi: https://doi.org/10.1016/S0004-3702(02)00374-0. URL https://www.sciencedirect.com/science/article/pii/S0004370202003740. Planning with Uncertainty and Incomplete Information.
- Ff-replan: A baseline for probabilistic planning. In International Conference on Automated Planning and Scheduling, 2007. URL https://api.semanticscholar.org/CorpusID:15013602.
- On bayesian upper confidence bounds for bandit problems. In Neil D. Lawrence and Mark Girolami, editors, Proceedings of the Fifteenth International Conference on Artificial Intelligence and Statistics, volume 22 of Proceedings of Machine Learning Research, pages 592–600, La Palma, Canary Islands, 21–23 Apr 2012. PMLR. URL https://proceedings.mlr.press/v22/kaufmann12.html.
- David Speck. Symbolic Search for Optimal Planning with Expressive Extensions. PhD thesis, University of Freiburg, 2022.
- Pybullet, a python module for physics simulation for games, robotics and machine learning. http://pybullet.org, 2016–2021.
- Caelan Reed Garrett. PyBullet Planning. https://pypi.org/project/pybullet-planning/, 2018.
- Mask R-CNN. CoRR, abs/1703.06870, 2017. URL http://arxiv.org/abs/1703.06870.
- POMCPOW: an online algorithm for pomdps with continuous state, action, and observation spaces. CoRR, abs/1709.06196, 2017. URL http://arxiv.org/abs/1709.06196.
- Task-directed exploration in continuous pomdps for robotic manipulation of articulated objects, 2022.
- Robust-rrt: Probabilistically-complete motion planning for uncertain nonlinear systems, 2022.
- Discovering state and action abstractions for generalized task and motion planning. CoRR, abs/2109.11082, 2021b. URL https://arxiv.org/abs/2109.11082.
- Predicate invention for bilevel planning. Proceedings of the AAAI Conference on Artificial Intelligence, 37(10):12120–12129, Jun. 2023. doi: 10.1609/aaai.v37i10.26429. URL https://ojs.aaai.org/index.php/AAAI/article/view/26429.