Age Minimization in Massive IoT via UAV Swarm: A Multi-agent Reinforcement Learning Approach (2309.14757v1)
Abstract: In many massive IoT communication scenarios, the IoT devices require coverage from dynamic units that can move close to the IoT devices and reduce the uplink energy consumption. A robust solution is to deploy a large number of UAVs (UAV swarm) to provide coverage and a better line of sight (LoS) for the IoT network. However, the study of these massive IoT scenarios with a massive number of serving units leads to high dimensional problems with high complexity. In this paper, we apply multi-agent deep reinforcement learning to address the high-dimensional problem that results from deploying a swarm of UAVs to collect fresh information from IoT devices. The target is to minimize the overall age of information in the IoT network. The results reveal that both cooperative and partially cooperative multi-agent deep reinforcement learning approaches are able to outperform the high-complexity centralized deep reinforcement learning approach, which stands helpless in large-scale networks.
- P. Popovski, F. Chiariotti, K. Huang, A. E. Kalør, M. Kountouris, N. Pappas, and B. Soret, “A perspective on time toward wireless 6G,” Proceedings of the IEEE, vol. 110, no. 8, pp. 1116–1146, 2022.
- A. Kosta, N. Pappas, and V. Angelakis, “Age of information: A new concept, metric, and tool,” Foundations and Trends in Networking, Now Publishers, Inc., 2017.
- M. Mozaffari, W. Saad, M. Bennis, Y.-H. Nam, and M. Debbah, “A Tutorial on UAVs for Wireless Networks: Applications, Challenges, and Open Problems,” IEEE Communications Surveys & Tutorials, vol. 21, no. 3, pp. 2334–2360, 2019.
- N. Gao, X. Li, S. Jin, and M. Matthaiou, “3-D deployment of UAV swarm for massive MIMO communications,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 10, pp. 3022–3034, 2021.
- W. Fan, K. Luo, S. Yu, Z. Zhou, and X. Chen, “AoI-driven fresh situation awareness by UAV swarm: Collaborative DRL-based energy-efficient trajectory control and data processing,” in 2020 IEEE/CIC International Conference on Communications in China (ICCC), 2020, pp. 841–846.
- R. Arnold, J. Jablonski, B. Abruzzo, and E. Mezzacappa, “Heterogeneous UAV multi-role swarming behaviors for search and rescue,” in 2020 IEEE Conference on Cognitive and Computational Aspects of Situation Management (CogSIMA), 2020, pp. 122–128.
- C. Mao, J. Liu, and L. Xie, “Multi-UAV aided data collection for age minimization in wireless sensor networks,” in 2020 International Conference on Wireless Communications and Signal Processing (WCSP), 2020, pp. 80–85.
- M. Wang, L. Li, W. Lin, B. Wei, W. Chen, and Z. Han, “UAV position optimization based on information freshness: A mean field game approach,” in 2021 13th International Conference on Wireless Communications and Signal Processing (WCSP), 2021, pp. 1–5.
- C. Liu, Y. Guo, N. Li, and X. Song, “AoI-minimal task assignment and trajectory optimization in multi-UAV-assisted IoT networks,” IEEE Internet of Things Journal, vol. 9, no. 21, pp. 21 777–21 791, 2022.
- M. Yi, X. Wang, J. Liu, Y. Zhang, and B. Bai, “Deep reinforcement learning for fresh data collection in UAV-assisted IoT networks,” in IEEE INFOCOM 2020 - IEEE Conference on Computer Communications Workshops (INFOCOM WKSHPS), 2020, pp. 716–721.
- M. Chen, D. Gündüz, K. Huang, W. Saad, M. Bennis, A. V. Feljan, and H. V. Poor, “Distributed learning in wireless networks: Recent progress and future challenges,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 12, pp. 3579–3605, 2021.
- Y. Teng, M. Yan, D. Liu, Z. Han, and M. Song, “Distributed learning solution for uplink traffic control in energy harvesting massive machine-type communications,” IEEE Wireless Communications Letters, vol. 9, no. 4, pp. 485–489, 2020.
- F. Wu, H. Zhang, J. Wu, L. Song, Z. Han, and H. V. Poor, “AoI minimization for UAV-to-device underlay communication by multi-agent deep reinforcement learning,” in GLOBECOM 2020 - 2020 IEEE Global Communications Conference, 2020, pp. 1–6.
- E. Eldeeb, D. E. Pérez, J. Michel de Souza Sant’Ana, M. Shehab, N. H. Mahmood, H. Alves, and M. Latva-Aho, “A learning-based trajectory planning of multiple UAVs for AoI minimization in IoT networks,” in 2022 Joint European Conference on Networks and Communications & 6G Summit (EuCNC/6G Summit), 2022, pp. 172–177.
- E. Eldeeb, J. M. d. S. Sant’Ana, D. E. Pérez, M. Shehab, N. H. Mahmood, and H. Alves, “Multi-uav path learning for age and power optimization in iot with uav battery recharge,” IEEE Transactions on Vehicular Technology, pp. 1–5, 2022.
- M. A. Abd-Elmagid, A. Ferdowsi, H. S. Dhillon, and W. Saad, “Deep reinforcement learning for minimizing age-of-information in UAV-assisted networks,” in 2019 IEEE GLOBECOM, 2019, pp. 1–6.
- H. Ye and G. Y. Li, “Deep reinforcement learning for resource allocation in V2V communications,” in 2018 IEEE International Conference on Communications (ICC), 2018, pp. 1–6.