MPC of Uncertain Nonlinear Systems with Meta-Learning for Fast Adaptation of Neural Predictive Models (2404.12097v1)
Abstract: In this paper, we consider the problem of reference tracking in uncertain nonlinear systems. A neural State-Space Model (NSSM) is used to approximate the nonlinear system, where a deep encoder network learns the nonlinearity from data, and a state-space component captures the temporal relationship. This transforms the nonlinear system into a linear system in a latent space, enabling the application of model predictive control (MPC) to determine effective control actions. Our objective is to design the optimal controller using limited data from the \textit{target system} (the system of interest). To this end, we employ an implicit model-agnostic meta-learning (iMAML) framework that leverages information from \textit{source systems} (systems that share similarities with the target system) to expedite training in the target system and enhance its control performance. The framework consists of two phases: the (offine) meta-training phase learns a aggregated NSSM using data from source systems, and the (online) meta-inference phase quickly adapts this aggregated model to the target system using only a few data points and few online training iterations, based on local loss function gradients. The iMAML algorithm exploits the implicit function theorem to exactly compute the gradient during training, without relying on the entire optimization path. By focusing solely on the optimal solution, rather than the path, we can meta-train with less storage complexity and fewer approximations than other contemporary meta-learning algorithms. We demonstrate through numerical examples that our proposed method can yield accurate predictive models by adaptation, resulting in a downstream MPC that outperforms several baselines.
- F. Lin and R. D. Brandt, “An optimal control approach to robust control of robot manipulators,” IEEE Transactions on Robotics and Automation, vol. 14, no. 1, pp. 69–77, 1998.
- H. Pan and M. Xin, “Nonlinear robust and optimal control of robot manipulators,” Nonlinear Dynamics, vol. 76, pp. 237–254, 2014.
- D. L. Pepyne and C. G. Cassandras, “Optimal control of hybrid systems in manufacturing,” Proceedings of the IEEE, vol. 88, no. 7, pp. 1108–1123, 2000.
- T. Dierks and S. Jagannathan, “Online optimal control of affine nonlinear discrete-time systems with unknown internal dynamics by using time-based policy update,” IEEE Transactions on Neural Networks and Learning Systems, vol. 23, no. 7, pp. 1118–1129, 2012.
- H. Modares, F. L. Lewis, W. Kang, and A. Davoudi, “Optimal synchronization of heterogeneous nonlinear systems with unknown dynamics,” IEEE Transactions on Automatic Control, vol. 63, no. 1, pp. 117–131, 2017.
- B. Zhao, D. Liu, and C. Luo, “Reinforcement learning-based optimal stabilization for unknown nonlinear systems subject to inputs with uncertain constraints,” IEEE Transactions on Neural Networks and Learning Systems, vol. 31, no. 10, pp. 4330–4340, 2019.
- A. Chakrabarty, G. Wichern, and C. R. Laughman, “Meta-learning of neural state-space models using data from similar systems,” IFAC-PapersOnLine, vol. 56, no. 2, pp. 1490–1495, 2023.
- C. Legaard, T. Schranz, G. Schweiger, J. Drgoňa, B. Falay, C. Gomes, A. Iosifidis, M. Abkar, and P. Larsen, “Constructing neural network based models for simulating dynamical systems,” ACM Computing Surveys, vol. 55, no. 11, pp. 1–34, 2023.
- D. Masti and A. Bemporad, “Learning nonlinear state–space models using autoencoders,” Automatica, vol. 129, p. 109666, 2021.
- J. A. Suykens, B. L. De Moor, and J. Vandewalle, “Nonlinear system identification using neural state space models, applicable to robust control design,” International Journal of Control, vol. 62, no. 1, pp. 129–152, 1995.
- B. O. Koopman and J. v. Neumann, “Dynamical systems of continuous spectra,” Proceedings of the National Academy of Sciences, vol. 18, no. 3, pp. 255–263, 1932.
- M. Korda and I. Mezić, “Linear predictors for nonlinear dynamical systems: Koopman operator meets model predictive control,” Automatica, vol. 93, pp. 149–160, 2018.
- L. Xin, L. Ye, G. Chiu, and S. Sundaram, “Identifying the dynamics of a system by leveraging data from similar systems,” in 2022 American Control Conference (ACC). IEEE, 2022, pp. 818–824.
- A. Chakrabarty, “Optimizing closed-loop performance with data from similar systems: A bayesian meta-learning approach,” in 2022 IEEE 61st Conference on Decision and Control (CDC). IEEE, 2022, pp. 130–136.
- L. Li, C. De Persis, P. Tesi, and N. Monshizadeh, “Data-based transfer stabilization in linear systems,” IEEE Transactions on Automatic Control, 2023.
- V. Dumoulin, N. Houlsby, U. Evci, X. Zhai, R. Goroshin, S. Gelly, and H. Larochelle, “Comparing transfer and meta learning approaches on a unified few-shot classification benchmark,” arXiv preprint arXiv:2104.02638, 2021.
- L. F. Toso, D. Zhan, J. Anderson, and H. Wang, “Meta-learning linear quadratic regulators: A policy gradient maml approach for the model-free lqr,” arXiv preprint arXiv:2401.14534, 2024.
- S. Zhan, G. Wichern, C. Laughman, A. Chong, and A. Chakrabarty, “Calibrating building simulation models using multi-source datasets and meta-learned bayesian optimization,” Energy and Buildings, vol. 270, p. 112278, 2022.
- R. Kaushik, T. Anne, and J.-B. Mouret, “Fast online adaptation in robotics through meta-learning embeddings of simulated priors,” in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2020, pp. 5269–5276.
- X. Song, Y. Yang, K. Choromanski, K. Caluwaerts, W. Gao, C. Finn, and J. Tan, “Rapidly adaptable legged robots via evolutionary meta-learning,” in 2020 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS). IEEE, 2020, pp. 3769–3776.
- A. Rajeswaran, C. Finn, S. M. Kakade, and S. Levine, “Meta-learning with implicit gradients,” Advances in neural information processing systems, vol. 32, 2019.
- C. E. Garcia, D. M. Prett, and M. Morari, “Model predictive control: Theory and practice—a survey,” Automatica, vol. 25, no. 3, pp. 335–348, 1989.
- C. Finn, P. Abbeel, and S. Levine, “Model-agnostic meta-learning for fast adaptation of deep networks,” in International conference on machine learning. PMLR, 2017, pp. 1126–1135.
- G. Brockman, V. Cheung, L. Pettersson, J. Schneider, J. Schulman, J. Tang, and W. Zaremba, “Openai gym,” arXiv preprint arXiv:1606.01540, 2016.
Paper Prompts
Sign up for free to create and run prompts on this paper using GPT-5.
Top Community Prompts
Collections
Sign up for free to add this paper to one or more collections.