Reinforcement Learning-based Receding Horizon Control using Adaptive Control Barrier Functions for Safety-Critical Systems (2403.17338v2)
Abstract: Optimal control methods provide solutions to safety-critical problems but easily become intractable. Control Barrier Functions (CBFs) have emerged as a popular technique that facilitates their solution by provably guaranteeing safety, through their forward invariance property, at the expense of some performance loss. This approach involves defining a performance objective alongside CBF-based safety constraints that must always be enforced. Unfortunately, both performance and solution feasibility can be significantly impacted by two key factors: (i) the selection of the cost function and associated parameters, and (ii) the calibration of parameters within the CBF-based constraints, which capture the trade-off between performance and conservativeness. %as well as infeasibility. To address these challenges, we propose a Reinforcement Learning (RL)-based Receding Horizon Control (RHC) approach leveraging Model Predictive Control (MPC) with CBFs (MPC-CBF). In particular, we parameterize our controller and use bilevel optimization, where RL is used to learn the optimal parameters while MPC computes the optimal control input. We validate our method by applying it to the challenging automated merging control problem for Connected and Automated Vehicles (CAVs) at conflicting roadways. Results demonstrate improved performance and a significant reduction in the number of infeasible cases compared to traditional heuristic approaches used for tuning CBF-based controllers, showcasing the effectiveness of the proposed method.
- Control barrier function based quadratic programs with application to adaptive cruise control. In 53rd IEEE Conf. on Decision and Control, pages 6271–6278. IEEE, 2014.
- Optimal control of connected automated vehicles with event-triggered control barrier functions: a test bed for safe optimal merging. In 2023 IEEE Conf. on Control Technology and Applications (CCTA), pages 321–326, 2023.
- Iterative convex optimization for model predictive control with discrete-time high-order control barrier functions. In 2023 American Control Conference (ACC), pages 3368–3375, 2023.
- Safety-critical model predictive control with discrete-time control barrier function. In 2021 American Control Conference (ACC), pages 3882–3889, 2021.
- Enhancing feasibility and safety of nonlinear model predictive control with discrete-time control barrier functions. In 2021 60th IEEE Conference on Decision and Control (CDC), pages 6137–6144, 2021.
- Adaptive control barrier functions. IEEE Trans. on Automatic Control, 67(5):2267–2281, 2022.
- Barriernet: Differentiable control barrier functions for learning of safe robot control. IEEE Transactions on Robotics, 39(3):2289–2307, 2023.
- D. A. Pomerleau. Alvinn: An autonomous land vehicle in a neural network. In D. Touretzky, editor, Advances in Neural Information Processing Systems, volume 1. Morgan-Kaufmann, 1988.
- S. Ross and D. Bagnell. Efficient reductions for imitation learning. In Yee Whye Teh and Mike Titterington, editors, Proc. of the Thirteenth International Conf. on Artificial Intelligence and Statistics, volume 9 of Proceedings of Machine Learning Research, pages 661–668, Chia Laguna Resort, Sardinia, Italy, 13–15 May 2010. PMLR.
- M. Zanon and S. Gros. Safe reinforcement learning using robust mpc. IEEE Transactions on Automatic Control, 66(8):3638–3652, 2021.
- Actor-critic model predictive control, 2024.
- A general framework for decentralized safe optimal control of connected and automated vehicles in multi-lane signal-free intersections. IEEE Transactions on Intelligent Transportation Systems, 23(10):17382–17396, 2022.
- J. Rios-Torres and A. A Malikopoulos. Automated and cooperative vehicle merging at highway on-ramps. IEEE Transactions on Intelligent Transportation Systems, 18(4):780–789, 2016.
- Decentralized optimal coordination of connected and automated vehicles for multiple traffic scenarios. Automatica, 117:108958, 2020.
- Control lyapunov functions and hybrid zero dynamics. In 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), pages 6837–6842, 2012.
- W. Xiao and C. Belta. High-order control barrier functions. IEEE Transactions on Automatic Control, 67(7):3655–3662, 2022.
- Q. Nguyen and K. Sreenath. Exponential control barrier functions for enforcing high relative-degree safety-critical constraints. In 2016 American Control Conference (ACC), pages 322–328. IEEE, 2016.
- Control lyapunov functions and hybrid zero dynamics. In 2012 IEEE 51st IEEE Conference on Decision and Control (CDC), pages 6837–6842. IEEE, 2012.
- Continuous control with deep reinforcement learning. US Patent, 15(217,758), 2020.
- Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor. In International conference on machine learning, pages 1861–1870. PMLR, 2018.
- Playing atari with deep reinforcement learning. arXiv preprint arXiv:1312.5602, 2013.
- W. Xiao and C. G Cassandras. Decentralized optimal merging control for connected and automated vehicles on curved roads. In 2021 60th IEEE Conference on Decision and Control (CDC), pages 2677–2682. IEEE, 2021.
- K. Vogel. A comparison of headway and time to collision as safety indicators. Accident Analysis & Prevention, 35(3):427–433, 2003.
- CasADi – A software framework for nonlinear optimization and optimal control. Mathematical Programming Computation, 2018.
- Model predictive control of vehicles on urban roads for improved fuel economy. IEEE Trans. on Control Systems Technology, 21(3):831–841, 2012.
- Ehsan Sabouni (11 papers)
- H. M. Sabbir Ahmad (5 papers)
- Vittorio Giammarino (11 papers)
- Christos G. Cassandras (116 papers)
- Ioannis Ch. Paschalidis (66 papers)
- Wenchao Li (48 papers)