MADRL-based UAVs Trajectory Design with Anti-Collision Mechanism in Vehicular Networks (2402.03342v1)
Abstract: In upcoming 6G networks, unmanned aerial vehicles (UAVs) are expected to play a fundamental role by acting as mobile base stations, particularly for demanding vehicle-to-everything (V2X) applications. In this scenario, one of the most challenging problems is the design of trajectories for multiple UAVs, cooperatively serving the same area. Such joint trajectory design can be performed using multi-agent deep reinforcement learning (MADRL) algorithms, but ensuring collision-free paths among UAVs becomes a critical challenge. Traditional methods involve imposing high penalties during training to discourage unsafe conditions, but these can be proven to be ineffective, whereas binary masks can be used to restrict unsafe actions, but naively applying them to all agents can lead to suboptimal solutions and inefficiencies. To address these issues, we propose a rank-based binary masking approach. Higher-ranked UAVs move optimally, while lower-ranked UAVs use this information to define improved binary masks, reducing the number of unsafe actions. This approach allows to obtain a good trade-off between exploration and exploitation, resulting in enhanced training performance, while maintaining safety constraints.
- A. I. Hentati and L. C. Fourati, “Comprehensive survey of UAVs communication networks,” Computer Standards and Interfaces, vol. 72, p. 103451, 2020.
- S. Mignardi, D. Ferretti, R. Marini, F. Conserva, S. Bartoletti, R. Verdone, and C. Buratti, “Optimizing beam selection and resource allocation in UAV-aided vehicular networks,” in EuCNC/6G Summit 2022, 2022, pp. 184–189.
- B. M. Masini, A. Bazzi, and A. Zanella, “A survey on the roadmap to mandate on board connectivity and enable V2V-based vehicular sensor networks,” Sensors, vol. 18, no. 7, 2018.
- 5GAA, “A visionary roadmap for advanced driving use cases, connectivity technologies, and radio spectrum needs,” White Paper, Sep. 2020.
- J. Snape, J. V. D. Berg, S. J. Guy, and D. Manocha, “The hybrid reciprocal velocity obstacle,” IEEE Trans. Robot., vol. 27, no. 4, pp. 696–706, Apr. 2011.
- S. Huang and K. H. Low, “A path planning algorithm for smooth trajectories of unmanned aerial vehicles via potential fields,” in Int. Conf. Control Autom. Robot. Vis. (ICARCV), Singapore, Nov. 2018, pp. 1677–1684.
- B. Song, Z. Wang, L. Zou, L. Xu, and F. E. Alsaadi, “A new approach to smooth global path planning of mobile robots with kinematic constraints,” Int. J. Mach. Learn. Cybern., vol. 10, no. 1, pp. 107–119, Jan. 2019.
- L. Spampinato, A. Tarozzi, C. Buratti, and R. Marini, “DRL path planning for UAV-aided V2X networks: comparing discrete to continuous action spaces,” in ICASSP 2023, 2023, pp. 1–5.
- R. Marini, L. Spampinato, S. Mignardi, R. Verdone, and C. Buratti, “Reinforcement learning-based trajectory planning for UAV-aided vehicular communications,” in EUSIPCO 2022, 2022, pp. 967–971.
- R. Marini, S. Park, O. Simeone, and C. Buratti, “Continual meta-reinforcement learning for UAV-aided vehicular wireless networks,” in ICC 2023 - IEEE International Conference on Communications, 2023, pp. 5664–5669.
- E. Testi, E. Favarelli, and A. Giorgetti, “Reinforcement learning for connected autonomous vehicle localization via UAVs,” in IEEE Int. Workshop Metrol. Agric. For. (MetroAgriFor), Trento, Italy, Nov. 2020, pp. 13–17.
- A. Guerra, F. Guidi, D. Dardari, and P. M. Djuric, “Reinforcement learning for joint detection and mapping using dynamic UAV networks,” IEEE Trans. Aerosp. Electron. Syst., pp. 1–16, Aug. 2023.
- X. Wang and M. C. Gursoy, “Learning-based uav trajectory optimization with collision avoidance and connectivity constraints,” IEEE Transactions on Wireless Communications, vol. 21, no. 6, pp. 4350–4363, 2022.
- S. Xu, X. Zhang, C. Li, D. Wang, and L. Yang, “Deep reinforcement learning approach for joint trajectory design in multi-UAV IoT networks,” IEEE Transactions on Vehicular Technology, vol. 71, no. 3, pp. 3389–3394, Mar. 2022.
- N. Thumiger and M. Deghat, “A multi-agent deep reinforcement learning approach for practical decentralized UAV collision avoidance,” IEEE Control System Letters, vol. 6, pp. 2174–2179, Dec. 2022.
- H. Qie, D. Shi, T. Shen, X. Xu, Y. Li, and L. Wang, “Joint optimization of multi-UAV target assignment and path planning based on multi-agent reinforcement learning,” IEEE Access, vol. 7, pp. 146 264–146 272, Sep. 2019.
- P. A. L. et al., “Microscopic traffic simulation using SUMO,” 2019 IEEE ITSC, pp. 2575–2582, November 2018.
- 3GPP, “Technical specification group radio access network; study on channel model for frequencies from 0.5 to 100 GHz,” TR 38 901 version 16.1.0, Dec. 2019.
- Leonardo Spampinato (1 paper)
- Enrico Testi (5 papers)
- Chiara Buratti (5 papers)
- Riccardo Marini (4 papers)