Learning-Augmented Decentralized Online Convex Optimization in Networks (2306.10158v3)
Abstract: This paper studies decentralized online convex optimization in a networked multi-agent system and proposes a novel algorithm, Learning-Augmented Decentralized Online optimization (LADO), for individual agents to select actions only based on local online information. LADO leverages a baseline policy to safeguard online actions for worst-case robustness guarantees, while staying close to the ML policy for average performance improvement. In stark contrast with the existing learning-augmented online algorithms that focus on centralized settings, LADO achieves strong robustness guarantees in a decentralized setting. We also prove the average cost bound for LADO, revealing the tradeoff between average performance and worst-case robustness and demonstrating the advantage of training the ML policy by explicitly considering the robustness requirement.
- Decentralized voltage control of power systems using multi-agent systems. Journal of Modern Power Systems and Clean Energy, 8(2):249–259, 2020.
- Stability constrained reinforcement learning for real-time voltage control. arXiv preprint arXiv:2109.14854, 2021.
- Online distributed convex optimization on dynamic networks. IEEE Transactions on Automatic Control, 61(11):3545–3550, 2016.
- The computational sprinting game. In ASPLOS, 2016.
- Single and multi-agent deep reinforcement learning for ai-enabled wireless networks: A tutorial. IEEE Communications Surveys & Tutorials, 23(2):1226–1252, 2021.
- Multi-agent deep reinforcement learning for dynamic power allocation in wireless networks. IEEE Journal on Selected Areas in Communications, 37(10):2239–2250, 2019.
- A collaborative multi-agent reinforcement learning anti-jamming algorithm in wireless networks. IEEE wireless communications letters, 8(4):1024–1027, 2019.
- Clearance pricing optimization for a fast-fashion retailer. Operations research, 60(6):1404–1422, 2012.
- Optimal pricing in networks with externalities. Operations Research, 60(4):883–905, 2012.
- A saddle point algorithm for networked online convex optimization. IEEE Transactions on Signal Processing, 63(19):5149–5164, 2015.
- Decentralized online convex optimization based on signs of relative states. Automatica, 129:109676, 2021.
- Collaborative deep reinforcement learning. arXiv preprint arXiv:1702.05796, 2017.
- Decentralized online convex optimization with feedback delays. IEEE Transactions on Automatic Control, 67(6):2889–2904, 2021.
- Trading regret for efficiency: Online convex optimization with long term constraints. J. Mach. Learn. Res., 13(1):2503–2528, sep 2012.
- An online algorithm for smoothed online convex optimization. SIGMETRICS Perform. Eval. Rev., 47(2):6–8, December 2019.
- Beyond online balanced descent: An optimal algorithm for smoothed online optimization. In NeurIPS, volume 32, 2019.
- Revisiting smoothed online learning. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.
- Online optimization with memory and competitive control. Advances in Neural Information Processing Systems, 33:20636–20647, 2020.
- Online optimization with feedback delay and nonlinear switching cost. Proc. ACM Meas. Anal. Comput. Syst., 6(1), feb 2022.
- Smoothed online convex optimization in high dimensions via online balanced descent. In COLT, 2018.
- Distributed online convex optimization with an aggregative variable. IEEE Transactions on Control of Network Systems, 2021.
- A new dog learns old tricks: RL finds classic optimization algorithms. In ICLR, 2019.
- Exploratory combinatorial optimization with reinforcement learning. In AAAI, 2020.
- Reles: A neural adaptive multipath scheduler based on deep reinforcement learning. In INFOCOM, 2019.
- Learning for robust combinatorial optimization: Algorithm and application. In INFOCOM, 2022.
- Fully decentralized multi-agent reinforcement learning with networked agents. In International Conference on Machine Learning, pages 5872–5881. PMLR, 2018.
- Multi-agent reinforcement learning: A selective overview of theories and algorithms. Handbook of Reinforcement Learning and Control, pages 321–384, 2021.
- A review of cooperative multi-agent deep reinforcement learning. Applied Intelligence, pages 1–46, 2022.
- Robustified learning for online optimization with memory costs. In INFOCOM, 2023.
- Chasing convex bodies and functions with black-box advice. In COLT, 2022.
- Online algorithms with advice: A survey. SIGACT News, 47(3):93–129, August 2016.
- Optimal robustness-consistency trade-offs for learning-augmented online algorithms. In NeurIPS, 2020.
- The primal-dual method for learning augmented algorithms. Advances in Neural Information Processing Systems, 33:20083–20094, 2020.
- Competitive caching with machine learned advice. J. ACM, 68(4), July 2021.
- Distributed online convex optimization with compressed communication. In S. Koyejo, S. Mohamed, A. Agarwal, D. Belgrave, K. Cho, and A. Oh, editors, Advances in Neural Information Processing Systems, volume 35, pages 34492–34504. Curran Associates, Inc., 2022.
- Decentralized online convex optimization in networked systems. In Kamalika Chaudhuri, Stefanie Jegelka, Le Song, Csaba Szepesvari, Gang Niu, and Sivan Sabato, editors, Proceedings of the 39th International Conference on Machine Learning, volume 162 of Proceedings of Machine Learning Research, pages 13356–13393. PMLR, 17–23 Jul 2022.
- Conservative bandits. In International Conference on Machine Learning, pages 1254–1262. PMLR, 2016.
- A reduction-based framework for conservative bandits and reinforcement learning. In International Conference on Learning Representations, 2021.
- Conservative exploration in reinforcement learning. In Silvia Chiappa and Roberto Calandra, editors, Proceedings of the Twenty Third International Conference on Artificial Intelligence and Statistics, volume 108 of Proceedings of Machine Learning Research, pages 1431–1441. PMLR, 26–28 Aug 2020.
- Dynamic right-sizing for power-proportional data centers. In INFOCOM, 2011.
- Exploiting spatio-temporal diversity for water saving in geo-distributed data centers. IEEE Transactions on Cloud Computing, 6(3):734–746, 2018.
- Meta. Sustainability report, 2021, https://sustainability.fb.com/.
- Beyond online balanced descent: An optimal algorithm for smoothed online optimization. Advances in Neural Information Processing Systems, 32, 2019.
- Control regularization for reduced variance reinforcement learning. In Kamalika Chaudhuri and Ruslan Salakhutdinov, editors, Proceedings of the 36th International Conference on Machine Learning, volume 97 of Proceedings of Machine Learning Research, pages 1141–1150. PMLR, 09–15 Jun 2019.
- Smooth imitation learning for online sequence prediction. In ICML, 2016.
- A reduction-based framework for conservative bandits and reinforcement learning. In International Conference on Learning Representations, 2022.
- Robust learning-augmented caching: An experimental study. In ICML, 2021.
- A regression approach to learning-augmented online algorithms. In A. Beygelzimer, Y. Dauphin, P. Liang, and J. Wortman Vaughan, editors, Advances in Neural Information Processing Systems, 2021.
- Improving online algorithms via ml predictions. In NeurIPS, 2018.
- Learning robust algorithms for online allocation problems using adversarial training. In https://arxiv.org/abs/2010.08418, 2020.
- Smoothed online optimization with unreliable predictions. Proc. ACM Meas. Anal. Comput. Syst., 7(1), mar 2023.
- Online metric algorithms with untrusted predictions. In ICML, 2020.
- Bounded-regret MPC via perturbation analysis: Prediction error, constraints, and nonlinearity. In Alice H. Oh, Alekh Agarwal, Danielle Belgrave, and Kyunghyun Cho, editors, Advances in Neural Information Processing Systems, 2022.
- Differentiable convex optimization layers. In H. Wallach, H. Larochelle, A. Beygelzimer, F. d'Alché-Buc, E. Fox, and R. Garnett, editors, Advances in Neural Information Processing Systems, volume 32. Curran Associates, Inc., 2019.
- Carbon-aware computing for datacenters. IEEE Transactions on Power Systems, 38(2):1270–1280, 2023.
- U.S. Department of Engergy. What is environmental justice?, https://www.energy.gov/lm/what-environmental-justice.
- Balanced control strategies for interconnected heterogeneous battery systems. IEEE Transactions on Sustainable Energy, 7(1):189–199, 2016.
- Resource central: Understanding and predicting workloads for improved resource management in large cloud platforms. In Proceedings of the 26th Symposium on Operating Systems Principles, pages 153–167, 2017.
- The national solar radiation data base (nsrdb). Renewable and sustainable energy reviews, 89:51–60, 2018.
- Photovoltaic and solar power forecasting for smart grid energy management. CSEE Journal of Power and Energy Systems, 1(4):38–46, 2015.
- Wind turbine blade efficiency and power calculation with electrical analogy. International Journal of Scientific and Research Publications, 2(2):1–5, 2012.
- Convex optimization and approximation. https://ee227c.github.io/notes/ee227c-notes.pdf, 2018.