Intent-Aware DRL-Based NOMA Uplink Dynamic Scheduler for IIoT (2403.18364v2)
Abstract: We investigate the problem of supporting Industrial Internet of Things user equipment (IIoT UEs) with intent (i.e., requested quality of service (QoS)) and random traffic arrival. A deep reinforcement learning (DRL) based centralized dynamic scheduler for time-frequency resources is proposed to learn how to schedule the available communication resources among the IIoT UEs. The proposed scheduler leverages an RL framework to adapt to the dynamic changes in the wireless communication system and traffic arrivals. Moreover, a graph-based reduction scheme is proposed to reduce the state and action space of the RL framework to allow fast convergence and a better learning strategy. Simulation results demonstrate the effectiveness of the proposed intelligent scheduler in guaranteeing the expressed intent of IIoT UEs compared to several traditional scheduling schemes, such as round-robin, semi-static, and heuristic approaches. The proposed scheduler also outperforms the contention-free and contention-based schemes in maximizing the number of successfully computed tasks.
- Academic Press, 2020.
- M. H. C. Garcia, A. Molina-Galan, M. Boban, J. Gozalvez, B. Coll-Perales, T. Şahin, and A. Kousaridas, “A tutorial on 5g nr v2x communications,” IEEE Communications Surveys & Tutorials, vol. 23, no. 3, pp. 1972–2026, 2021.
- J. Navarro-Ortiz, P. Romero-Diaz, S. Sendra, P. Ameigeiras, J. J. Ramos-Munoz, and J. M. Lopez-Soler, “A survey on 5g usage scenarios and traffic models,” IEEE Communications Surveys & Tutorials, vol. 22, no. 2, pp. 905–929, 2020.
- S. R. Pokhrel, J. Ding, J. Park, O.-S. Park, and J. Choi, “Towards enabling critical mmtc: A review of urllc within mmtc,” IEEE Access, vol. 8, pp. 131796–131813, 2020.
- A. Leivadeas and M. Falkner, “A survey on intent based networking,” IEEE Communications Surveys & Tutorials, 2022.
- A. Clemm, L. Ciavaglia, L. Z. Granville, and J. Tantsura, “Intent-based networking-concepts and definitions,” IRTF draft work-in-progress, 2020.
- K. Abbas, T. A. Khan, M. Afaq, J. J. D. Rivera, and W.-C. Song, “Network data analytics function for ibn-based network slice lifecycle management,” in 2021 22nd Asia-Pacific Network Operations and Management Symposium (APNOMS), pp. 148–153, IEEE, 2021.
- T. Qiu, J. Chi, X. Zhou, Z. Ning, M. Atiquzzaman, and D. O. Wu, “Edge computing in industrial internet of things: Architecture, advances and challenges,” IEEE Communications Surveys & Tutorials, vol. 22, no. 4, pp. 2462–2488, 2020.
- Y. Mao, C. You, J. Zhang, K. Huang, and K. B. Letaief, “A survey on mobile edge computing: The communication perspective,” IEEE communications surveys & tutorials, vol. 19, no. 4, pp. 2322–2358, 2017.
- Academic Press, 2014.
- W. Guan, X. Wen, L. Wang, Z. Lu, and Y. Shen, “A service-oriented deployment policy of end-to-end network slicing based on complex network theory,” IEEE access, vol. 6, pp. 19691–19701, 2018.
- H. Zhang, N. Liu, X. Chu, K. Long, A.-H. Aghvami, and V. C. Leung, “Network slicing based 5g and future mobile networks: mobility, resource management, and challenges,” IEEE communications magazine, vol. 55, no. 8, pp. 138–145, 2017.
- P. Popovski, K. F. Trillingsgaard, O. Simeone, and G. Durisi, “5g wireless network slicing for embb, urllc, and mmtc: A communication-theoretic view,” Ieee Access, vol. 6, pp. 55765–55779, 2018.
- S.-Y. Lien, S.-C. Hung, D.-J. Deng, and Y. J. Wang, “Efficient ultra-reliable and low latency communications and massive machine-type communications in 5g new radio,” in GLOBECOM 2017-2017 IEEE Global Communications Conference, pp. 1–7, IEEE, 2017.
- F. Blanquez-Casado, G. Gomez, M. d. C. Aguayo-Torres, and J. T. Entrambasaguas, “eolla: an enhanced outer loop link adaptation for cellular networks,” EURASIP Journal on Wireless Communications and Networking, vol. 2016, pp. 1–16, 2016.
- I.-W. Lai, C.-H. Lee, K.-C. Chen, and E. Biglieri, “Path-permutation codes for end-to-end transmission in ad hoc cognitive radio networks,” IEEE Transactions on Wireless Communications, vol. 14, no. 6, pp. 3309–3321, 2015.
- J. Park, S. Samarakoon, H. Shiri, M. K. Abdel-Aziz, T. Nishio, A. Elgabli, and M. Bennis, “Extreme urllc: Vision, challenges, and key enablers,” arXiv preprint arXiv:2001.09683, 2020.
- J. M. Meredith, “Study on downlink multiuser superposition transmission for LTE,” in TSG RAN Meeting, vol. 67, 2015.
- C. You, K. Huang, H. Chae, and B.-H. Kim, “Energy-efficient resource allocation for mobile-edge computation offloading,” IEEE Transactions on Wireless Communications, vol. 16, no. 3, pp. 1397–1411, 2016.
- A. Destounis, G. S. Paschos, J. Arnau, and M. Kountouris, “Scheduling urllc users with reliable latency guarantees,” in 2018 16th International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOpt), pp. 1–8, IEEE, 2018.
- A. Destounis and G. S. Paschos, “Complexity of urllc scheduling and efficient approximation schemes,” in 2019 International Symposium on Modeling and Optimization in Mobile, Ad Hoc, and Wireless Networks (WiOPT), pp. 1–8, IEEE, 2019.
- American Mathematical Soc., 2009.
- J. Schulman, F. Wolski, P. Dhariwal, A. Radford, and O. Klimov, “Proximal policy optimization algorithms,” arXiv preprint arXiv:1707.06347, 2017.
- C. Yu, A. Velu, E. Vinitsky, Y. Wang, A. M. Bayen, and Y. Wu, “The surprising effectiveness of MAPPO in cooperative, multi-agent games.,” arXiv preprint arXiv:2103.01955, 2021.
- T. Haarnoja, A. Zhou, P. Abbeel, and S. Levine, “Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor,” in International conference on machine learning, pp. 1861–1870, PMLR, 2018.
- J. Schulman, P. Moritz, S. Levine, M. Jordan, and P. Abbeel, “High-dimensional continuous control using generalized advantage estimation,” arXiv preprint arXiv:1506.02438, 2015.
- S. Sinha, H. Bharadhwaj, A. Srinivas, and A. Garg, “D2rl: Deep dense architectures in reinforcement learning,” arXiv preprint arXiv:2010.09163, 2020.
- “Further advancements for E-UTRA physical layer aspects (Release 9), 3GPP standard TS 36.814,” Mar. 2010.
- D. P. Kingma and J. Ba, “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
- R. H. Arpaci-Dusseau and A. C. Arpaci-Dusseau, “Chapter: Scheduling introduction,” Operating Systems: Three Easy Pieces; Arpaci-Dusseau Books: WI, USA, 2014.