MemorAI: Energy-Efficient Last-Level Cache Memory Optimization for Virtualized RANs (2405.02029v1)
Abstract: The virtualization of Radio Access Networks (vRAN) is well on its way to become a reality, driven by its advantages such as flexibility and cost-effectiveness. However, virtualization comes at a high price - virtual Base Stations (vBSs) sharing the same computing platform incur a significant computing overhead due to in extremis consumption of shared cache memory resources. Consequently, vRAN suffers from increased energy consumption, which fuels the already high operational costs in 5G networks. This paper investigates cache memory allocation mechanisms' effectiveness in reducing total energy consumption. Using an experimental vRAN platform, we profile the energy consumption and CPU utilization of vBS as a function of the network state (e.g., traffic demand, modulation scheme). Then, we address the high dimensionality of the problem by decomposing it per vBS, which is possible thanks to the Last-Level Cache (LLC) isolation implemented in our system. Based on this, we train a vBS digital twin, which allows us to train offline a classifier, avoiding the performance degradation of the system during training. Our results show that our approach performs very closely to an offline optimal oracle, outperforming standard approaches used in today's deployments.
- M. Masoudi, S. S. Lisi, and C. Cavdar, “Cost-effective migration toward virtualized c-ran with scalable fronthaul design,” IEEE Systems Journal, vol. 14, no. 4, pp. 5100–5110, 2020.
- F. W. Murti et al., “An optimal deployment framework for multi-cloud virtualized radio access networks,” IEEE Transactions on Wireless Communications, vol. 20, no. 4, pp. 2251–2265, 2020.
- Samsung, “Virtualized Radio Access Network: Architecture, Key technologies and Benefits.” Technical Report, 2019, Link.
- O-RAN Alliance, “Cloud Architecture and Deployment Scenarios for O-RAN Virtualized RAN (O-RAN.WG6.CADS-v04.00) ,” Technical Report, Oct. 2022.
- A. Tootoonchian et al., “ResQ: Enabling SLOs in Network Function Virtualization,” in Proceedings of the 15th USENIX NSDI, 2018, pp. 283–297.
- G. Garcia-Aviles, A. Garcia-Saavedra, M. Gramaglia, X. Costa-Perez, P. Serrano, and A. Banchs, “Nuberu: Reliable ran virtualization in shared platforms,” in Proceedings of the 27th Annual International Conference on Mobile Computing and Networking, 2021, pp. 749–761.
- P. Kumar, N. Dukkipati, N. Lewis, Y. Cui, Y. Wang, C. Li, V. Valancius, J. Adriaens, S. Gribble, N. Foster et al., “Picnic: predictable virtualized nic,” in Proceedings of the ACM Special Interest Group on Data Communication, 2019, pp. 351–366.
- L. Subramanian et al., “The application slowdown model: Quantifying and controlling the impact of inter-application interference at shared caches and main memory,” in Proceedings of the 48th International Symposium on Microarchitecture, 2015, pp. 62–75.
- J. Park, S. Park, and W. Baek, “Copart: Coordinated partitioning of last-level cache and memory bandwidth for fairness-aware workload consolidation on commodity servers,” in Proceedings of the Fourteenth EuroSys Conference 2019, 2019, pp. 1–16.
- V. Selfa et al., “Application clustering policies to address system fairness with intel’s cache allocation technology,” in 2017 26th international conference on parallel architectures and compilation techniques (pact). IEEE, 2017, pp. 194–205.
- J. X. Salvat Lozano, A. Garcia-Saavedra, X. Li, and X. Costa Perez, “AIRIC: Orchestration of virtualized radio access networks with noisy neighbours,” Accepted for publication in IEEE Journal on Selected Areas in Communications, 2023.
- X. Fan, W.-D. Weber, and L. A. Barroso, “Power provisioning for a warehouse-sized computer,” ACM SIGARCH computer architecture news, vol. 35, no. 2, pp. 13–23, 2007.
- C. Lefurgy et al., “Server-level power control,” in Fourth International Conference on Autonomic Computing (ICAC’07). IEEE, 2007, pp. 4–4.
- J. A. Ayala-Romero et al., “vrain: Deep learning based orchestration for computing and radio resources in vrans,” IEEE Transactions on Mobile Computing, vol. 21, no. 7, pp. 2652–2670, 2022.
- J. Patterson, “Modern microprocessors: A 90 minute guide!” Cortex, vol. 15, p. A57, 2003.
- U. Drepper, “What every programmer should know about memory,” Red Hat, Inc, vol. 11, p. 2007, 2007, Link.
- L. Prechelt, “Early stopping-but when?” in Neural Networks: Tricks of the trade. Springer, 2002, pp. 55–69.
- O-RAN Alliance, “O-RAN Non-RT RIC Architecture 3.0 (O-RAN.WG2.Non-RT-RIC-ARCH-R003-v03.00),” Technical Report, Jun. 2023.
- C. Marquez et al., “How should i slice my network? a multi-service empirical evaluation of resource sharing efficiency,” in Proceedings of the 24th Annual International Conference on Mobile Computing and Networking, 2018, pp. 191–206.
- J. Khalid, E. Rozner, W. Felter, C. Xu, K. Rajamani, A. Ferreira, and A. Akella, “Iron: Isolating network-based {{\{{CPU}}\}} in container environments,” in 15th USENIX Symposium on Networked Systems Design and Implementation (NSDI 18), 2018, pp. 313–328.
- X. Foukas and B. Radunovic, “Concordia: Teaching the 5g vran to share compute,” in Proceedings of the 2021 ACM SIGCOMM 2021 Conference, 2021, pp. 580–596.