Receding-Constraint Model Predictive Control using a Learned Approximate Control-Invariant Set (2309.11124v2)
Abstract: In recent years, advanced model-based and data-driven control methods are unlocking the potential of complex robotics systems, and we can expect this trend to continue at an exponential rate in the near future. However, ensuring safety with these advanced control methods remains a challenge. A well-known tool to make controllers (either Model Predictive Controllers or Reinforcement Learning policies) safe, is the so-called control-invariant set (a.k.a. safe set). Unfortunately, for nonlinear systems, such a set cannot be exactly computed in general. Numerical algorithms exist for computing approximate control-invariant sets, but classic theoretic control methods break down if the set is not exact. This paper presents our recent efforts to address this issue. We present a novel Model Predictive Control scheme that can guarantee recursive feasibility and/or safety under weaker assumptions than classic methods. In particular, recursive feasibility is guaranteed by making the safe-set constraint move backward over the horizon, and assuming that such set satisfies a condition that is weaker than control invariance. Safety is instead guaranteed under an even weaker assumption on the safe set, triggering a safe task-abortion strategy whenever a risk of constraint violation is detected. We evaluated our approach on a simulated robot manipulator, empirically demonstrating that it leads to less constraint violations than state-of-the-art approaches, while retaining reasonable performance in terms of tracking cost, number of completed tasks, and computation time.
- F. Blanchini, “Set invariance in control,” Automatica, vol. 35, pp. 1747–1767, 1999.
- A. D. Ames, J. W. Grizzle, and P. Tabuada, “Control barrier function based quadratic programs with application to adaptive cruise control,” in IEEE Conference on Decision and Control, 2014, pp. 6271–6278.
- Z. Wu, F. Albalawi, Z. Zhang, J. Zhang, H. Durand, and P. D. Christofides, “Control Lyapunov-Barrier function-based model predictive control of nonlinear systems,” Automatica, vol. 109, p. 108508, 2019. [Online]. Available: https://doi.org/10.1016/j.automatica.2019.108508
- B. Djeridane and J. Lygeros, “Neural approximation of pde solutions: An application to reachability computations,” in IEEE Conference on Decision and Control, 2006, pp. 3034–3039.
- P.-A. Coquelin, S. Martin, and R. Munos, “A dynamic programming approach to viability problems,” in IEEE International Symposium on Approximate Dynamic Programming and Reinforcement Learning, 2007, pp. 178–184.
- F. Jiang, G. Chou, M. Chen, and C. J. Tomlin, “Using neural networks to compute approximate and guaranteed feasible hamilton-jacobi-bellman pde solutions,” 2016. [Online]. Available: https://www.arxiv.org/abs/1611.03158
- V. Rubies-Royo and C. Tomlin, “Recursive Regression with Neural Networks: Approximating the HJI PDE Solution,” in International Conference on Learning Representations, 2017.
- K. C. Hsu, V. Rubies-Royo, C. J. Tomlin, and J. F. Fisac, “Safety and Liveness Guarantees through Reach-Avoid Reinforcement Learning,” Robotics: Science and Systems, 2021.
- C. Dawson, S. Gao, and C. Fan, “Safe Control With Learned Certificates : A Survey of Neural Lyapunov , Barrier , and Contraction Methods for Robotics and Control,” IEEE Transactions on Robotics, vol. 39, no. 3, pp. 1749–1767, 2023.
- Y. Zhou, D. Li, Y. Xi, and Y. Xu, “Data-driven approximation for feasible regions in nonlinear model predictive control,” 2020. [Online]. Available: https://arxiv.org/abs/2012.03428
- A. La Rocca, M. Saveriano, and A. Del Prete, “VBOC: Learning the Viability Boundary of a Robot Manipulator using Optimal Control,” IEEE Robotics and Automation Letters, 2023.
- D. Q. Mayne, J. B. Rawlings, C. V. Rao, and P. O. Scokaert, “Constrained model predictive control: Stability and optimality,” Automatica, vol. 36, no. 6, pp. 789–814, 2000.
- E. C. Kerrigan and J. M. Maciejowski, “Soft constraints and exact penalty functions in model predictive control,” in UKACC International Conference (Control), 2000.
- M. N. Zeilinger, M. Morari, and C. N. Jones, “Soft constrained model predictive control with robust stability guarantees,” IEEE Transactions on Automatic Control, vol. 59, no. 5, pp. 1190–1202, 2014.
- N. Mansard, A. Del Prete, M. Geisert, S. Tonneau, and O. Stasse, “Using a Memory of Motion to Efficiently Warm-Start a Nonlinear Predictive Controller,” in IEEE International Conference on Robotics and Automation, 2018, pp. 2986–2993.
- G. Grandesso, E. Alboni, G. P. Papini, P. M. Wensing, and A. Del Prete, “CACTO: Continuous Actor-Critic With Trajectory Optimization - Towards Global Optimality,” IEEE Robotics and Automation Letters, vol. 8, no. 6, pp. 3318–3325, 2023.
- J. A. E. Andersson, J. Gillis, G. Horn, J. B. Rawlings, and M. Diehl, “CasADi – A software framework for nonlinear optimization and optimal control,” Mathematical Programming Computation, vol. 11, no. 1, pp. 1–36, 2019.
- R. Verschueren, G. Frison, D. Kouzoupis, J. Frey, N. van Duijkeren, A. Zanelli, B. Novoselnik, T. Albin, R. Quirynen, and M. Diehl, “Acados: a modular open-source framework for fast embedded optimal control,” Mathematical Programming Computation, vol. 14, pp. 147–183, 2019.
- M. Diehl, R. Findeisen, F. Allgöwer, H. G. Bock, and J. P. Schlöder, “Nominal stability of real-time iteration scheme for nonlinear model predictive control,” IEE Proceedings-Control Theory and Applications, vol. 152, no. 3, pp. 296–308, 2005.