Federated Learning With Energy Harvesting Devices: An MDP Framework (2405.10513v1)
Abstract: Federated learning (FL) requires edge devices to perform local training and exchange information with a parameter server, leading to substantial energy consumption. A critical challenge in practical FL systems is the rapid energy depletion of battery-limited edge devices, which curtails their operational lifespan and affects the learning performance. To address this issue, we apply energy harvesting technique in FL systems to extract ambient energy for continuously powering edge devices. We first establish the convergence bound for the wireless FL system with energy harvesting devices, illustrating that the convergence is impacted by partial device participation and packet drops, both of which depend on the energy supply. To accelerate the convergence, we formulate a joint device scheduling and power control problem and model it as a Markov decision process (MDP). By solving this MDP, we derive the optimal transmission policy and demonstrate that it possesses a monotone structure with respect to the battery and channel states. To overcome the curse of dimensionality caused by the exponential complexity of computing the optimal policy, we propose a low-complexity algorithm, which is asymptotically optimal as the number of devices increases. Furthermore, for unknown channels and harvested energy statistics, we develop a structure-enhanced deep reinforcement learning algorithm that leverages the monotone structure of the optimal policy to improve the training performance. Finally, extensive numerical experiments on real-world datasets are presented to validate the theoretical results and corroborate the effectiveness of the proposed algorithms.
- B. R. Kiran, I. Sobh, V. Talpaert, P. Mannion, A. A. Al Sallab, S. Yogamani, and P. Pérez, “Deep reinforcement learning for autonomous driving: A survey,” IEEE Transactions on Intelligent Transportation Systems, 2021.
- L. U. Khan, W. Saad, Z. Han, E. Hossain, and C. S. Hong, “Federated learning for internet of things: Recent advances, taxonomy, and open challenges,” IEEE Communications Surveys & Tutorials, vol. 23, no. 3, pp. 1759–1799, 2021.
- M. Aledhari, R. Razzak, R. M. Parizi, and F. Saeed, “Federated learning: A survey on enabling technologies, protocols, and applications,” IEEE Access, vol. 8, pp. 140699–140725, 2020.
- K. Yang, T. Jiang, Y. Shi, and Z. Ding, “Federated learning via over-the-air computation,” IEEE Transactions on Wireless Communications, vol. 19, no. 3, pp. 2022–2035, 2020.
- K. Pillutla, S. M. Kakade, and Z. Harchaoui, “Robust aggregation for federated learning,” IEEE Transactions on Signal Processing, vol. 70, pp. 1142–1154, 2022.
- M. Chen, Z. Yang, W. Saad, C. Yin, H. V. Poor, and S. Cui, “A joint learning and communications framework for federated learning over wireless networks,” IEEE Transactions on Wireless Communications, vol. 20, no. 1, pp. 269–283, 2021.
- C. T. Dinh, N. H. Tran, M. N. H. Nguyen, C. S. Hong, W. Bao, A. Y. Zomaya, and V. Gramoli, “Federated learning over wireless networks: Convergence analysis and resource allocation,” IEEE/ACM Transactions on Networking, vol. 29, no. 1, pp. 398–409, 2021.
- T. Sery, N. Shlezinger, K. Cohen, and Y. C. Eldar, “Over-the-air federated learning from heterogeneous data,” IEEE Transactions on Signal Processing, vol. 69, pp. 3796–3811, 2021.
- X. Cao, G. Zhu, J. Xu, Z. Wang, and S. Cui, “Optimized power control design for over-the-air federated edge learning,” IEEE Journal on Selected Areas in Communications, vol. 40, no. 1, pp. 342–358, 2021.
- Y. Sun, S. Zhou, Z. Niu, and D. Gündüz, “Dynamic scheduling for over-the-air federated edge learning with energy constraints,” IEEE Journal on Selected Areas in Communications, vol. 40, no. 1, pp. 227–242, 2021.
- J. Xu and H. Wang, “Client selection and bandwidth allocation in wireless federated learning networks: A long-term perspective,” IEEE Transactions on Wireless Communications, vol. 20, no. 2, pp. 1188–1200, 2020.
- W. Shi, S. Zhou, Z. Niu, M. Jiang, and L. Geng, “Joint device scheduling and resource allocation for latency constrained wireless federated learning,” IEEE Transactions on Wireless Communications, vol. 20, no. 1, pp. 453–467, 2020.
- S. Wan, J. Lu, P. Fan, Y. Shao, C. Peng, and K. B. Letaief, “Convergence analysis and system design for federated learning over wireless networks,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 12, pp. 3622–3639, 2021.
- C. Shen, J. Yang, and J. Xu, “On Federated Learning with Energy Harvesting Clients,” in ICASSP 2022 - 2022 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), (Singapore, Singapore), pp. 8657–8661, IEEE, May 2022.
- L. Zeng, D. Wen, G. Zhu, C. You, Q. Chen, and Y. Shi, “Federated Learning With Energy Harvesting Devices,” IEEE Transactions on Green Communications and Networking, pp. 1–1, 2023.
- R. Hamdi, M. Chen, A. B. Said, M. Qaraqe, and H. V. Poor, “Federated Learning Over Energy Harvesting Wireless Networks,” IEEE Internet of Things Journal, vol. 9, pp. 92–103, Jan. 2022.
- C. Chen, Y.-H. Chiang, H. Lin, J. C. Lui, and Y. Ji, “Energy Harvesting Aware Client Selection for Over-the-Air Federated Learning,” in GLOBECOM 2022 - 2022 IEEE Global Communications Conference, (Rio de Janeiro, Brazil), pp. 5069–5074, IEEE, Dec. 2022.
- X. Liu, X. Qin, H. Chen, Y. Liu, B. Liu, and P. Zhang, “Age-aware communication strategy in federated learning with energy harvesting devices,” in 2021 IEEE/CIC International Conference on Communications in China (ICCC), pp. 358–363, IEEE, 2021.
- Q. An, Y. Zhou, Z. Wang, H. Shan, Y. Shi, and M. Bennis, “Online optimization for over-the-air federated learning with energy harvesting,” IEEE Transactions on Wireless Communications, 2023.
- Z. Yang, M. Chen, W. Saad, C. S. Hong, and M. Shikh-Bahaei, “Energy efficient federated learning over wireless communication networks,” IEEE Transactions on Wireless Communications, vol. 20, no. 3, pp. 1935–1949, 2020.
- Y. Xi, A. Burr, J. Wei, and D. Grace, “A general upper bound to evaluate packet error rate over quasi-static fading channels,” IEEE Transactions on Wireless Communications, vol. 10, no. 5, pp. 1373–1377, 2011.
- X. Zhang, M. Hong, S. Dhople, W. Yin, and Y. Liu, “Fedpd: A federated learning framework with adaptivity to non-iid data,” IEEE Transactions on Signal Processing, vol. 69, pp. 6055–6070, 2021.
- L. T. Nguyen, J. Kim, and B. Shim, “Gradual federated learning with simulated annealing,” IEEE Transactions on Signal Processing, vol. 69, pp. 6299–6313, 2021.
- B. Wang, J. Fang, H. Li, and Y. C. Eldar, “Communication efficient confederated learning: An event-triggered saga approach,” IEEE Transactions on Signal Processing, 2024.
- H. H. Yang, Z. Chen, and T. Q. Quek, “Unleashing edgeless federated learning with analog transmissions,” IEEE Transactions on Signal Processing, 2024.
- K. Zhang and X. Cao, “Online power control for distributed multitask learning over noisy fading wireless channels,” IEEE Transactions on Signal Processing, 2023.
- T. Gafni, K. Cohen, and Y. C. Eldar, “Federated learning from heterogeneous data via controlled air aggregation with bayesian estimation,” IEEE Transactions on Signal Processing, 2024.
- P. Sadeghi, R. A. Kennedy, P. B. Rapajic, and R. Shams, “Finite-state markov modeling of fading channels-a survey of principles and applications,” IEEE Signal Processing Magazine, vol. 25, no. 5, pp. 57–80, 2008.
- M. L. Puterman, Markov decision processes: discrete stochastic dynamic programming. John Wiley & Sons, 2014.
- N. Balaji, S. Kiefer, P. Novotnỳ, G. A. Pérez, and M. Shirmohammadi, “On the complexity of value iteration,” arXiv preprint arXiv:1807.04920, 2018.
- E. Altman, Constrained Markov decision processes. Routledge, 2021.
- F. J. Beutler and K. W. Ross, “Optimal policies for controlled markov chains with a constraint,” Journal of Mathematical Analysis and Applications, vol. 112, no. 1, pp. 236–252, 1985.
- V. Mnih, K. Kavukcuoglu, D. Silver, A. A. Rusu, J. Veness, M. G. Bellemare, A. Graves, M. Riedmiller, A. K. Fidjeland, G. Ostrovski, et al., “Human-level control through deep reinforcement learning,” Nature, vol. 518, no. 7540, pp. 529–533, 2015.
- C. C. Tan and N. C. Beaulieu, “On first-order markov modeling for the rayleigh fading channel,” IEEE Transactions on Communications, vol. 48, no. 12, pp. 2032–2040, 2000.
- A. Andreas and T. Stoffel, “NREL Solar Radiation Research Laboratory (SRRL): Baseline Measurement System (BMS); Golden, Colorado (Data),” Tech. Rep. NREL Report No. DA-5500-56488, National Renewable Energy Laboratory (NREL), 2012.
- H. Karimi, J. Nutini, and M. Schmidt, “Linear convergence of gradient and proximal-gradient methods under the polyak-łojasiewicz condition,” in Machine Learning and Knowledge Discovery in Databases: European Conference, ECML PKDD 2016, Riva Del Garda, Italy, September 19-23, 2016, Proceedings, Part I 16, pp. 795–811, Springer, 2016.
- S. Loyka, V. Kostina, and F. Gagnon, “On convexity of error rates in digital communications,” IEEE transactions on information theory, vol. 59, no. 10, pp. 6501–6516, 2013.
- P. Diaconis and S. Zabell, “Closed form summation for classical distributions: variations on a theme of de moivre,” Statistical Science, pp. 284–302, 1991.