Zero-Shot Transfer of Neural ODEs (2405.08954v2)
Abstract: Autonomous systems often encounter environments and scenarios beyond the scope of their training data, which underscores a critical challenge: the need to generalize and adapt to unseen scenarios in real time. This challenge necessitates new mathematical and algorithmic tools that enable adaptation and zero-shot transfer. To this end, we leverage the theory of function encoders, which enables zero-shot transfer by combining the flexibility of neural networks with the mathematical principles of Hilbert spaces. Using this theory, we first present a method for learning a space of dynamics spanned by a set of neural ODE basis functions. After training, the proposed approach can rapidly identify dynamics in the learned space using an efficient inner product calculation. Critically, this calculation requires no gradient calculations or retraining during the online phase. This method enables zero-shot transfer for autonomous systems at runtime and opens the door for a new class of adaptable control algorithms. We demonstrate state-of-the-art system modeling accuracy for two MuJoCo robot environments and show that the learned models can be used for more efficient MPC control of a quadrotor.
- Representation learning: A review and new perspectives. IEEE Transactions on Pattern Analysis and Machine Intelligence, 35(8):1798–1828, 2013.
- Koopman operator dynamical models: Learning, analysis and control. Annual Reviews in Control, 52:197–212, 2021.
- D. Broomhead and D. Lowe. Radial basis functions, multi-variable functional interpolation and adaptive networks. Royal Signals and Radar Establishment Malvern (United Kingdom), RSRE-MEMO-4148, 03 1988.
- Discovering governing equations from data by sparse identification of nonlinear dynamical systems. Proceedings of the National Academy of Sciences, 113(15):3932–3937, 2016.
- Neural ordinary differential equations. In Advances in Neural Information Processing Systems, volume 31, 2018.
- Volterra-series-based nonlinear system modeling and its engineering applications: A state-of-the-art review. Mechanical Systems and Signal Processing,, 87A:340–364, 2017.
- S. Dehuri and S.-B. Cho. A comprehensive survey on functional link neural networks and an adaptive PSO–BP learning for CFLNN. Neural Computing and Applications, 19(2):187–205, 2010.
- Taylor-lagrange neural ordinary differential equations: Toward fast training and evaluation of neural odes. In IJCAI, 2022a.
- Neural networks with physics-informed architectures and constraints for dynamical systems modeling. In L4DC, 2022b.
- How to learn and generalize from three minutes of data: Physics-constrained and uncertainty-aware neural stochastic differential equations. In CoRL, 2023.
- Augmented neural odes. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- Stochastic continuous normalizing flows: training SDEs as ODEs. In Proceedings of the Thirty-Seventh Conference on Uncertainty in Artificial Intelligence, volume 161 of Proceedings of Machine Learning Research, pages 1130–1140. PMLR, 27–30 Jul 2021.
- Zero-shot reinforcement learning via function encoders. In ICML. ICML, 2024.
- SINDy-PI: A robust algorithm for parallel implicit sparse identification of nonlinear dynamics. CoRR, abs/2004.02322, 2020.
- Learning differential equations that are easy to solve. In H. Larochelle, M. Ranzato, R. Hadsell, M. Balcan, and H. Lin, editors, Advances in Neural Information Processing Systems, volume 33, pages 4370–4380. Curran Associates, Inc., 2020.
- Neural controlled differential equations for irregular time series. In Advances in Neural Information Processing Systems, volume 33, pages 6696–6707, 2020.
- K. Lee and E. J. Parish. Parameterized neural ordinary differential equations: Applications to computational physics problems. Proceedings of the Royal Society A, 477(2253), 2021.
- Extended dynamic mode decomposition with dictionary learning: A data-driven adaptive spectral decomposition of the Koopman operator. Chaos: An Interdisciplinary Journal of Nonlinear Science, 27(10):103111, 2017.
- Neural SDE: stabilizing neural ODE networks with stochastic noise. CoRR, 2019.
- Beyond finite layer neural networks: Bridging deep architectures and numerical differential equations. In ICML, 2018.
- J. Patra and A. Kot. Nonlinear dynamic system identification using Chebyshev functional link artificial neural networks. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 32(4):505–511, 2002.
- Multistep neural networks for data-driven discovery of nonlinear dynamical systems, 2018.
- B. Schölkopf and A. J. Smola. Learning with Kernels: support vector machines, regularization, optimization, and beyond. Adaptive computation and machine learning series. MIT Press, 2002.
- J. N. K. Steven L. Brunton, Joshua L. Proctor. Sparse identification of nonlinear dynamics with control (SINDYc). IFAC-PapersOnLine, Volume 49, Issue 18:710–715, 2016.
- MuJoCo: A physics engine for model-based control. In 2012 IEEE/RSJ International Conference on Intelligent Robots and Systems, pages 5026–5033, 2012.
- Gaussian processes for machine learning. MIT press Cambridge, MA, 2006.
- An orthogonal neural network for function approximation. IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics), 26(5):779–785, 1996.
- Safe-control-gym: A unified benchmark suite for safe learning-based control and reinforcement learning in robotics. IEEE Robotics and Automation Letters, 7(4):11142–11149, 2022.