Controller Adaptation via Learning Solutions of Contextual Bayesian Optimization (2403.04881v2)
Abstract: In this work, we propose a framework for adapting the controller's parameters based on learning optimal solutions from contextual black-box optimization problems. We consider a class of control design problems for dynamical systems operating in different environments or conditions represented by contextual parameters. The overarching goal is to identify the controller parameters that maximize the controlled system's performance, given different realizations of the contextual parameters.We formulate a contextual Bayesian optimization problem in which the solution is actively learned using Gaussian processes to approximate the controller adaptation strategy. We demonstrate the efficacy of the proposed framework with a sim-to-real example. We learn the optimal weighting strategy of a model predictive control for connected and automated vehicles interacting with human-driven vehicles from simulations and then deploy it in a real-time experiment.
- M. Mehndiratta, E. Camci, and E. Kayacan, “Automated tuning of nonlinear model predictive controller by reinforcement learning,” in 2018 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2018, pp. 3016–3021.
- S. Gros and M. Zanon, “Reinforcement learning based on mpc and the stochastic policy gradient method,” in 2021 American Control Conference (ACC). IEEE, 2021, pp. 1947–1952.
- M. Neumann-Brosig, A. Marco, D. Schwarzmann, and S. Trimpe, “Data-efficient autotuning with bayesian optimization: An industrial control study,” IEEE Transactions on Control Systems Technology, vol. 28, no. 3, pp. 730–740, 2019.
- M. Schillinger, B. Hartmann, P. Skalecki, M. Meister, D. Nguyen-Tuong, and O. Nelles, “Safe active learning and safe bayesian optimization for tuning a pi-controller,” IFAC-PapersOnLine, vol. 50, no. 1, pp. 5967–5972, 2017, 20th IFAC World Congress.
- J. A. Paulson and A. Mesbah, “Data-driven scenario optimization for automated controller tuning with probabilistic performance guarantees,” IEEE Control Systems Letters, vol. 5, no. 4, pp. 1477–1482, 2020.
- A. A. Malikopoulos, P. Y. Papalambros, and D. N. Assanis, “Online identification and stochastic control for autonomous internal combustion engines,” Journal of Dynamic Systems, Measurement, and Control, vol. 132, no. 2, pp. 024 504–024 504, 2010.
- M. Menner, K. Berntorp, and S. Di Cairano, “Automated controller calibration by kalman filtering,” IEEE Transactions on Control Systems Technology, 2023.
- J. P. Allamaa, P. Patrinos, H. Van der Auweraer, and T. D. Son, “Sim2real for autonomous vehicle control using executable digital twin,” IFAC-PapersOnLine, vol. 55, no. 24, pp. 385–391, 2022, 10th IFAC Symposium on Advances in Automotive Control AAC 2022.
- B. Bischoff, D. Nguyen-Tuong, T. Koller, H. Markert, and A. Knoll, “Learning throttle valve control using policy search,” in Machine Learning and Knowledge Discovery in Databases. Springer Berlin Heidelberg, 2013, pp. 49–64.
- A. A. Malikopoulos, P. Papalambros, and D. Assanis, “Optimal engine calibration for individual driving styles,” in SAE Congress, 2008.
- V.-A. Le and A. A. Malikopoulos, “A cooperative optimal control framework for connected and automated vehicles in mixed traffic using social value orientation,” in 2022 61th IEEE Conference on Decision and Control (CDC), 2022, pp. 6272–6277.
- ——, “Optimal weight adaptation of model predictive control for connected and automated vehicles in mixed traffic with Bayesian optimization,” in 2023 American Control Conference (ACC). IEEE, 2023, pp. 1183–1188.
- A. A. Malikopoulos and L. Zhao, “A closed-form analytical solution for optimal coordination of connected and automated vehicles,” in 2019 American Control Conference (ACC). IEEE, 2019, pp. 3599–3604.
- A. Krause and C. Ong, “Contextual gaussian process bandit optimization,” Advances in neural information processing systems, vol. 24, 2011.
- F. Berkenkamp, A. Krause, and A. P. Schoellig, “Bayesian optimization with safety constraints: safe and automatic parameter tuning in robotics,” Machine Learning, vol. 112, no. 10, pp. 3713–3747, 2023.
- L. P. Fröhlich, C. Küttel, E. Arcari, L. Hewing, M. N. Zeilinger, and A. Carron, “Contextual tuning of model predictive control for autonomous racing,” in 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2022, pp. 10 555–10 562.
- W. Xu, B. Svetozarevic, L. Di Natale, P. Heer, and C. N. Jones, “Data-driven adaptive building thermal controller tuning with constraints: A primal–dual contextual bayesian optimization approach,” Applied Energy, vol. 358, p. 122493, 2024.
- D. R. Jones, M. Schonlau, and W. J. Welch, “Efficient global optimization of expensive black-box functions,” Journal of Global optimization, vol. 13, no. 4, pp. 455–492, 1998.
- N. Srinivas, A. Krause, S. M. Kakade, and M. W. Seeger, “Information-theoretic regret bounds for gaussian process optimization in the bandit setting,” IEEE transactions on information theory, vol. 58, no. 5, pp. 3250–3265, 2012.
- J. Kabzan, L. Hewing, A. Liniger, and M. N. Zeilinger, “Learning-based model predictive control for autonomous racing,” IEEE Robotics and Automation Letters, vol. 4, no. 4, pp. 3363–3370, 2019.
- B. Chalaki, L. E. Beaver, A. M. I. Mahbub, H. Bang, and A. A. Malikopoulos, “A research and educational robotic testbed for real-time control of emerging mobility systems: From theory to scaled experiments,” IEEE Control Systems Magazine, vol. 42, no. 6, pp. 20–34, 2022.