Meta Reinforcement Learning for Strategic IoT Deployments Coverage in Disaster-Response UAV Swarms (2401.11118v1)
Abstract: In the past decade, Unmanned Aerial Vehicles (UAVs) have grabbed the attention of researchers in academia and industry for their potential use in critical emergency applications, such as providing wireless services to ground users and collecting data from areas affected by disasters, due to their advantages in terms of maneuverability and movement flexibility. The UAVs' limited resources, energy budget, and strict mission completion time have posed challenges in adopting UAVs for these applications. Our system model considers a UAV swarm that navigates an area collecting data from ground IoT devices focusing on providing better service for strategic locations and allowing UAVs to join and leave the swarm (e.g., for recharging) in a dynamic way. In this work, we introduce an optimization model with the aim of minimizing the total energy consumption and provide the optimal path planning of UAVs under the constraints of minimum completion time and transmit power. The formulated optimization is NP-hard making it not applicable for real-time decision making. Therefore, we introduce a light-weight meta-reinforcement learning solution that can also cope with sudden changes in the environment through fast convergence. We conduct extensive simulations and compare our approach to three state-of-the-art learning models. Our simulation results prove that our introduced approach is better than the three state-of-the-art algorithms in providing coverage to strategic locations with fast convergence.
- Q. Wu, J. Xu, Y. Zeng, D. W. K. Ng, N. Al-Dhahir, R. Schober, and A. L. Swindlehurst, “A comprehensive overview on 5g-and-beyond networks with uavs: From communications to sensing and intelligence,” IEEE Journal on Selected Areas in Communications, vol. 39, no. 10, pp. 2912–2945, 2021.
- J.-C. Padró, F.-J. Muñoz, J. Planas, and X. Pons, “Comparison of four uav georeferencing methods for environmental monitoring purposes focusing on the combined use with airborne and satellite remote sensing platforms,” International journal of applied earth observation and geoinformation, vol. 75, pp. 130–140, 2019.
- M. Mozaffari, W. Saad, M. Bennis, and M. Debbah, “Mobile unmanned aerial vehicles (uavs) for energy-efficient internet of things communications,” IEEE Transactions on Wireless Communications, vol. 16, no. 11, pp. 7574–7589, 2017.
- Z. Huang, C. Chen, and M. Pan, “Multiobjective uav path planning for emergency information collection and transmission,” IEEE Internet of Things Journal, vol. 7, no. 8, pp. 6993–7009, 2020.
- M. B. Ghorbel, D. Rodríguez-Duarte, H. Ghazzai, M. J. Hossain, and H. Menouar, “Joint position and travel path optimization for energy efficient wireless data gathering using unmanned aerial vehicles,” IEEE Transactions on Vehicular Technology, vol. 68, no. 3, pp. 2165–2175, 2019.
- S. Wan, J. Lu, P. Fan, and K. B. Letaief, “Toward big data processing in iot: Path planning and resource management of uav base stations in mobile-edge computing system,” IEEE Internet of Things Journal, vol. 7, no. 7, pp. 5995–6009, 2020.
- M. A. Dhuheir, E. Baccour, A. Erbad, S. S. Al-Obaidi, and M. Hamdi, “Deep reinforcement learning for trajectory path planning and distributed inference in resource-constrained uav swarms,” IEEE Internet of Things Journal, vol. 10, no. 9, pp. 8185–8201, 2023.
- M. Dhuheir, A. Erbad, and S. Sabeeh, “Llhr: Low latency and high reliability cnn distributed inference for resource-constrained uav swarms,” in 2023 IEEE Wireless Communications and Networking Conference (WCNC). IEEE, 2023, pp. 1–6.
- Q. Zhang, W. Saad, M. Bennis, X. Lu, M. Debbah, and W. Zuo, “Predictive deployment of uav base stations in wireless networks: Machine learning meets contract theory,” IEEE Transactions on Wireless Communications, vol. 20, no. 1, pp. 637–652, 2021.
- X. Liu, Y. Liu, Y. Chen, and L. Hanzo, “Trajectory design and power control for multi-uav assisted wireless networks: A machine learning approach,” IEEE Transactions on Vehicular Technology, vol. 68, no. 8, pp. 7957–7969, 2019.
- C. Finn, P. Abbeel, and S. Levine, “Model-agnostic meta-learning for fast adaptation of deep networks,” in International conference on machine learning. PMLR, 2017, pp. 1126–1135.
- Y. Zeng, X. Xu, and R. Zhang, “Trajectory design for completion time minimization in uav-enabled multicasting,” IEEE Transactions on Wireless Communications, vol. 17, no. 4, pp. 2233–2246, 2018.
- K. Chen, Y. Wang, J. Zhao, X. Wang, and Z. Fei, “Urllc-oriented joint power control and resource allocation in uav-assisted networks,” IEEE Internet of Things Journal, vol. 8, no. 12, pp. 10 103–10 116, 2021.