LaPlaSS: Latent Space Planning for Stochastic Systems (2404.07063v1)
Abstract: Autonomous mobile agents often operate in hazardous environments, necessitating an awareness of safety. These agents can have non-linear, stochastic dynamics that must be considered during planning to guarantee bounded risk. Most state of the art methods require closed-form dynamics to verify plan correctness and safety however modern robotic systems often have dynamics that are learned from data. Thus, there is a need to perform efficient trajectory planning with guarantees on risk for agents without known dynamics models. We propose a "generate-and-test" approach to risk-bounded planning in which a planner generates a candidate trajectory using an approximate linear dynamics model and a validator assesses the risk of the trajectory, computing additional safety constraints for the planner if the candidate does not satisfy the desired risk bound. To acquire the approximate model, we use a variational autoencoder to learn a latent linear dynamics model and encode the planning problem into the latent space to generate the candidate trajectory. The VAE also serves to sample trajectories around the candidate to use in the validator. We demonstrate that our algorithm, LaPlaSS, is able to generate trajectory plans with bounded risk for a real-world agent with learned dynamics and is an order of magnitude more efficient than the state of the art.
- H. X. Li and B. C. Williams, “Generative planning for hybrid systems based on flow tubes,” in International Conference on Automated Planning and Scheduling, 2008. [Online]. Available: https://api.semanticscholar.org/CorpusID:10626542
- J. Chen, B. C. Williams, and C. Fan, “Optimal mixed discrete-continuous planning for linear hybrid systems,” Proceedings of the 24th International Conference on Hybrid Systems: Computation and Control, 2021. [Online]. Available: https://api.semanticscholar.org/CorpusID:231934007
- C. R. Garrett, R. Chitnis, R. Holladay, B. Kim, T. Silver, L. P. Kaelbling, and T. Lozano-Pérez, “Integrated task and motion planning,” Annual Review of Control, Robotics, and Autonomous Systems, vol. 4, no. 1, pp. 265–293, 2021.
- E. Fernández-González, B. C. Williams, and E. Karpas, “Scottyactivity: Mixed discrete-continuous planning with convex optimization,” J. Artif. Intell. Res., vol. 62, pp. 579–664, 2018. [Online]. Available: https://api.semanticscholar.org/CorpusID:51912874
- K. Jin, H. H. Zhuo, Z. Xiao, H. Wan, and S. Kambhampati, “Gradient-based mixed planning with symbolic and numeric action parameters,” Artif. Intell., vol. 313, p. 103789, 2021. [Online]. Available: https://api.semanticscholar.org/CorpusID:252512104
- W. Han, A. Jasour, and B. C. Williams, “Non-gaussian risk bounded trajectory optimization for stochastic nonlinear systems in uncertain environments,” in Proceedings of the 2022 International Conference on Robotics and Automation (ICRA 2022), 2022, pp. 11 044–11 050.
- M. Ono and B. C. Williams, “Iterative risk allocation: A new approach to robust model predictive control with a joint chance constraint,” in Proceedings of the 47th IEEE Conference on Decision and Control, Cancun, Mexico, December 2008.
- L. Blackmore and M. Ono, “Convex chance constrained predictive control without sampling,” in Proceedings of the AIAA Guidance, Navigation and Control Conference, 2009.
- R. Jonschkowski and O. Brock, “State representation learning in robotics: Using prior knowledge about physical interaction,” in Robotics: Science and Systems, 2014.
- L. Agostini, “Exploration and prediction of fluid dynamical systems using auto-encoder technology,” Physics of Fluids, vol. 32, no. 6, 06 2020, 067103.
- M. Lazzara, M. Chevalier, M. Colombo, J. Garay Garcia, C. Lapeyre, and O. Teste, “Surrogate modelling for an aircraft dynamic landing loads simulation using an lstm autoencoder-based dimensionality reduction approach,” Aerospace Science and Technology, vol. 126, p. 107629, 2022.
- M. Watter, J. Springenberg, J. Boedecker, and M. Riedmiller, “Embed to control: A locally linear latent dynamics model for control from raw images,” in Advances in Neural Information Processing Systems, C. Cortes, N. Lawrence, D. Lee, M. Sugiyama, and R. Garnett, Eds., vol. 28. Curran Associates, Inc., 2015.
- M. Fraccaro, S. Kamronn, U. Paquet, and O. Winther, “A disentangled recognition and nonlinear dynamics model for unsupervised learning,” in Advances in Neural Information Processing Systems, I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett, Eds., vol. 30. Curran Associates, Inc., 2017.
- M. Karl, M. Soelch, J. Bayer, and P. van der Smagt, “Deep variational bayes filters: Unsupervised learning of state space models from raw data,” in International Conference on Learning Representations, 2017.
- B. Lusch, J. N. Kutz, and S. L. Brunton, “Deep learning for universal linear embeddings of nonlinear dynamics,” Nature Communications, vol. 9, no. 1, p. 4950, 2018.
- Y. Li, H. He, J. Wu, D. Katabi, and A. Torralba, “Learning compositional koopman operators for model-based control,” in International Conference on Learning Representations, 2020.
- D. Hafner, T. Lillicrap, I. Fischer, R. Villegas, D. Ha, H. Lee, and J. Davidson, “Learning latent dynamics for planning from pixels,” arXiv preprint arXiv:1811.04551, 2018.
- T. Nguyen, R. Shu, T. Pham, H. Bui, and S. Ermon, “Temporal predictive coding for model-based planning in latent space,” in Proceedings of the 38th International Conference on Machine Learning, ser. Proceedings of Machine Learning Research, vol. 139. PMLR, 2021.
- T. V. A. N. Olesen, D. T. T. Nguyen, R. B. Palm, and S. Risi, “Evolutionary planning in latent space,” in Applications of Evolutionary Computation, P. A. Castillo and J. L. Jiménez Laredo, Eds. Cham: Springer International Publishing, 2021, pp. 522–536.
- J. Engel, M. Hoffman, and A. Roberts, “Latent constraints: Learning to generate conditionally from unconditional generative models,” in International Conference on Learning Representations, 2018.
- A. J. Lew and M. J. Buehler, “Encoding and exploring latent design space of optimal material structures via a vae-lstm model,” Forces in Mechanics, vol. 5, p. 100054, 2021.
- S. M. Park, H. G. Yoon, D. B. Lee, J. W. Choi, H. Y. Kwon, and C. Won, “Optimization of physical quantities in the autoencoder latent space,” Scientific Reports, vol. 12, no. 1, p. 9003, May 2022.
- J. Morton, F. D. Witherden, and M. J. Kochenderfer, “Deep variational koopman models: Inferring koopman observations for uncertainty-aware dynamics modeling and control,” in Proceedings of the Twenty-Eighth International Joint Conference on Artificial Intelligence, IJCAI-19. International Joint Conferences on Artificial Intelligence Organization, 7 2019, pp. 3173–3179.
- A. Wang, X. Huang, A. Jasour, and B. C. Williams, “Fast risk assessment for autonomous vehicles using learned models of agent futures,” in Robotics: Science and Systems, 2020.
- D. P. Kingma and M. Welling, “Auto-encoding variational bayes,” CoRR, vol. abs/1312.6114, 2013. [Online]. Available: https://api.semanticscholar.org/CorpusID:216078090
- H. Liu, Y. Tang, and H. H. Zhang, “A new chi-square approximation to the distribution of non-negative definite quadratic forms in non-central normal variables,” Computational Statistics & Data Analysis, vol. 53, no. 4, pp. 853–856, 2009. [Online]. Available: https://www.sciencedirect.com/science/article/pii/S0167947308005653
- A. Antonini, W. Guerra, V. Murali, T. Sayre-McCord, and S. Karaman, “The blackbird uav dataset,” The International Journal of Robotics Research, vol. 39, no. 10-11, pp. 1346–1364, 2020.