2000 character limit reached
Online Control of Linear Systems under Unbounded Noise (2402.10252v2)
Published 15 Feb 2024 in eess.SY, cs.LG, cs.SY, math.OC, and stat.ML
Abstract: This paper investigates the problem of controlling a linear system under possibly unbounded stochastic noise with unknown convex cost functions, known as an online control problem. In contrast to the existing work, which assumes the boundedness of noise, we show that an $ \tilde{O}(\sqrt{T}) $ high-probability regret can be achieved under unbounded noise, where $ T $ denotes the time horizon. Notably, the noise is only required to have a finite fourth moment. Moreover, when the costs are strongly convex and the noise is sub-Gaussian, we establish an $ O({\rm poly} (\log T)) $ regret bound.
- Tracking adversarial targets. In International Conference on Machine Learning, pp. 369–377. PMLR, 2014.
- Online control with adversarial disturbances. In International Conference on Machine Learning, pp. 111–119. PMLR, 2019a.
- Logarithmic regret for online control. Advances in Neural Information Processing Systems, 32:10175–10184, 2019b.
- Online learning for adversaries with memory: Price of past mistakes. Advances in Neural Information Processing Systems, 28:784–792, 2015.
- Convex Optimization. Cambridge University Press, 2004.
- Logarithmic regret for learning linear quadratic regulators efficiently. In International Conference on Machine Learning, pp. 1328–1337. PMLR, 2020.
- Online linear quadratic control. In International Conference on Machine Learning, pp. 1029–1038. PMLR, 2018.
- Logarithmic regret for adversarial online control. In International Conference on Machine Learning, pp. 3211–3221. PMLR, 2020.
- Regret-optimal estimation and control. IEEE Transactions on Automatic Control, 68(5):3041–3053, 2023.
- Hazan, E. Introduction to online convex optimization. Foundations and Trends® in Optimization, 2(3-4):157–325, 2016.
- Logarithmic regret algorithms for online convex optimization. Machine Learning, 69:169–192, 2007.
- The nonstochastic control problem. In Proceedings of the 31st International Conference on Algorithmic Learning Theory, pp. 408–421. PMLR, 2020.
- Stochastic model-based assessment of power systems subject to extreme wind power fluctuation. SICE Journal of Control, Measurement, and System Integration, 14(1):67–77, 2021.
- Stable process approach to analysis of systems under heavy-tailed noise: Modeling and stochastic linearization. IEEE Transactions on Automatic Control, 64(4):1344–1357, 2019.
- Klenke, A. Probability Theory: A Comprehensive Course. Springer Cham, third edition, 2020.
- Stochastic control of light UAV at landing with the aid of bearing-only observations. In Eighth International Conference on Machine Vision, volume 9875, pp. 474–483. SPIE, 2015.
- Orabona, F. A modern introduction to online learning. arXiv preprint arXiv:1912.13213, 2023.
- Uncertainty quantication and control during mars powered descent and landing using covariance steering. In 2018 AIAA Guidance, Navigation, and Control Conference, pp. 0611, 2018.
- Non-Gaussian power grid frequency fluctuations characterized by Lévy-stable laws and superstatistics. Nature Energy, 3(2):119–126, 2018.
- Naive exploration is optimal for online LQR. In International Conference on Machine Learning, pp. 8937–8948. PMLR, 2020.
- Improper learning for non-stochastic control. In Conference on Learning Theory, pp. 3320–3436. PMLR, 2020.