Collaborative Edge AI Inference over Cloud-RAN (2404.06007v1)
Abstract: In this paper, a cloud radio access network (Cloud-RAN) based collaborative edge AI inference architecture is proposed. Specifically, geographically distributed devices capture real-time noise-corrupted sensory data samples and extract the noisy local feature vectors, which are then aggregated at each remote radio head (RRH) to suppress sensing noise. To realize efficient uplink feature aggregation, we allow each RRH receives local feature vectors from all devices over the same resource blocks simultaneously by leveraging an over-the-air computation (AirComp) technique. Thereafter, these aggregated feature vectors are quantized and transmitted to a central processor (CP) for further aggregation and downstream inference tasks. Our aim in this work is to maximize the inference accuracy via a surrogate accuracy metric called discriminant gain, which measures the discernibility of different classes in the feature space. The key challenges lie on simultaneously suppressing the coupled sensing noise, AirComp distortion caused by hostile wireless channels, and the quantization error resulting from the limited capacity of fronthaul links. To address these challenges, this work proposes a joint transmit precoding, receive beamforming, and quantization error control scheme to enhance the inference accuracy. Extensive numerical experiments demonstrate the effectiveness and superiority of our proposed optimization algorithm compared to various baselines.
- K. B. Letaief, Y. Shi, J. Lu, and J. Lu, “Edge artificial intelligence for 6G: Vision, enabling technologies, and applications,” IEEE J. Sel. Areas Commun., vol. 40, no. 1, pp. 5–36, 2022.
- K. B. Letaief, W. Chen, Y. Shi, J. Zhang, and Y. A. Zhang, “The roadmap to 6G: AI empowered wireless networks,” IEEE Commun. Mag., vol. 57, no. 8, pp. 84–90, 2019.
- G. Zhu, D. Liu, Y. Du, C. You, J. Zhang, and K. Huang, “Toward an intelligent edge: Wireless communication meets machine learning,” IEEE Commun. Mag., vol. 58, no. 1, pp. 19–25, 2020.
- D. Wen, X. Li, Q. Zeng, J. Ren, and K. Huang, “An overview of data-importance aware radio resource management for edge machine learning,” J. Commun. Inf. Netw., vol. 4, no. 4, pp. 1–14, 2019.
- D. Li, Y. Gu, H. Ma, Y. Li, L. Zhang, R. Li, R. Hao, and E.-P. Li, “Deep learning inverse analysis of higher order modes in monocone tem cell,” IEEE Transactions on Microwave Theory and Techniques, vol. 70, no. 12, pp. 5332–5339, 2022.
- Q. Lan, D. Wen, Z. Zhang, Q. Zeng, X. Chen, P. Popovski, and K. Huang, “What is semantic communication? A view on conveying meaning in the era of machine intelligence,” J. Commun. Inf. Networks, vol. 6, no. 4, pp. 336–371, 2021.
- Y. Shi, K. Yang, T. Jiang, J. Zhang, and K. B. Letaief, “Communication-efficient edge AI: algorithms and systems,” IEEE Commun. Surv. Tutorials, vol. 22, no. 4, pp. 2167–2191, 2020.
- D. Wen, X. Li, Y. Zhou, Y. Shi, S. Wu, and C. Jiang, “Integrated sensing-communication-computation for edge artificial intelligence,” CoRR, vol. abs/2306.01162, 2023.
- M. Lee, G. Yu, and H. Dai, “Decentralized inference with graph neural networks in wireless communication systems,” IEEE Trans. Mob. Comput., vol. 22, no. 5, pp. 2582–2598, 2023.
- S. F. Yilmaz, B. Hasircioglu, and D. Gündüz, “Over-the-air ensemble inference with model privacy,” in IEEE International Symposium on Information Theory, ISIT 2022, Espoo, Finland, June 26 - July 1, 2022, pp. 1265–1270, IEEE, 2022.
- G. Zhu, Z. Lyu, X. Jiao, P. Liu, M. Chen, J. Xu, S. Cui, and P. Zhang, “Pushing AI to wireless network edge: an overview on integrated sensing, communication, and computation towards 6G,” Sci. China Inf. Sci., vol. 66, no. 3, p. 130301, 2023.
- J. Shao and J. Zhang, “Communication-computation trade-off in resource-constrained edge inference,” IEEE Commun. Mag., vol. 58, no. 12, pp. 20–26, 2020.
- K. Yang, Y. Shi, W. Yu, and Z. Ding, “Energy-efficient processing and robust wireless cooperative transmission for edge inference,” IEEE Internet Things J., vol. 7, no. 10, pp. 9456–9470, 2020.
- X. Huang and S. Zhou, “Dynamic compression ratio selection for edge inference systems with hard deadlines,” IEEE Internet Things J., vol. 7, no. 9, pp. 8800–8810, 2020.
- S. Yun, J.-M. Kang, S. Choi, and I.-M. Kim, “Cooperative Inference of DNNs Over Noisy Wireless Channels,” IEEE Trans. Veh. Technol., vol. 70, no. 8, pp. 8298–8303, 2021.
- Z. He, T. Zhang, and R. B. Lee, “Attacking and protecting data privacy in edge–cloud collaborative inference systems,” IEEE Internet Things J., vol. 8, no. 12, pp. 9706–9716, 2020.
- Y. Kang, J. Hauswald, C. Gao, A. Rovinski, T. Mudge, J. Mars, and L. Tang, “Neurosurgeon: Collaborative intelligence between the cloud and mobile edge,” ACM SIGARCH Comput. Archit. News, vol. 45, no. 1, pp. 615–629, 2017.
- E. Li, L. Zeng, Z. Zhou, and X. Chen, “Edge AI: On-demand accelerating deep neural network inference via edge computing,” IEEE Trans. Wireless Commun., vol. 19, no. 1, pp. 447–457, 2019.
- Z. Liu, Q. Lan, and K. Huang, “Resource allocation for multiuser edge inference with batching and early exiting,” IEEE J. Sel. Areas Commun., vol. 41, no. 4, pp. 1186–1200, 2023.
- W. Shi, Y. Hou, S. Zhou, Z. Niu, Y. Zhang, and L. Geng, “Improving device-edge cooperative inference of deep learning via 2-step pruning,” in IEEE INFOCOM WKSHPS, pp. 1–6, IEEE, 2019.
- J. Shao, H. Zhang, Y. Mao, and J. Zhang, “Branchy-GNN: A device-edge co-inference framework for efficient point cloud processing,” in ICASSP 2021-2021 IEEE ICASSP, pp. 8488–8492, IEEE, 2021.
- J. Shao and J. Zhang, “Bottlenet++: An end-to-end approach for feature compression in device-edge co-inference systems,” in 2020 IEEE ICC Workshops, pp. 1–6, IEEE, 2020.
- Q. Lan, Q. Zeng, P. POPOVSKI, D. GÜNDÜZ, and K. Huang, “Progressive feature transmission for split inference at the wireless edge,” IEEE Trans. on Wireless Commun., 2021.
- J. Shao, Y. Mao, and J. Zhang, “Task-oriented communication for multi-device cooperative edge inference,” IEEE Trans. Wireless Commun., 2022.
- H. Lee and S.-W. Kim, “Task-oriented edge networks: Decentralized learning over wireless fronthaul,” arXiv preprint arXiv:2312.01288, 2023.
- D. Wen, X. Jiao, P. Liu, G. Zhu, Y. Shi, and K. Huang, “Task-oriented Over-the-Air computation for multi-device Edge AI,” IEEE Trans. on Wireless Commun., 2023.
- Z. Zhuang, D. Wen, Y. Shi, G. Zhu, S. Wu, and D. Niyato, “Integrated Sensing-Communication-Computation for Over-the-Air Edge AI Inference,” IEEE Trans. on Wireless Commun., 2023.
- L. Liu and R. Zhang, “Optimized uplink transmission in multi-antenna C-RAN with spatial compression and forward,” IEEE Trans. Signal Process., vol. 63, no. 19, pp. 5083–5095, 2015.
- A. W. Dawson, M. K. Marina, and F. J. Garcia, “On the benefits of RAN virtualisation in C-RAN based mobile networks,” in Third European Workshop on Software Defined Networks, EWSDN 2014, Budapest, Hungary, September 1-3, 2014, pp. 103–108, IEEE Computer Society, 2014.
- Y. Shi, J. Zhang, K. B. Letaief, B. Bai, and W. Chen, “Large-scale convex optimization for ultra-dense cloud-RAN,” IEEE Wireless Commun., vol. 22, no. 3, pp. 84–91, 2015.
- H. Ma, X. Yuan, and Z. Ding, “Over-the-air federated learning in mimo cloud-ran systems,” arXiv preprint arXiv:2305.10000, 2023.
- Y. Shi, S. Xia, Y. Zhou, Y. Mao, C. Jiang, and M. Tao, “Vertical federated learning over cloud-ran: Convergence analysis and system optimization,” IEEE Trans. on Wireless Commun., pp. 1–1, 2023.
- R. G. Stephen and R. Zhang, “Joint millimeter-wave fronthaul and OFDMA resource allocation in ultra-dense CRAN,” IEEE Trans. Commun., vol. 65, no. 3, pp. 1411–1423, 2017.
- Y. Zhou and W. Yu, “Optimized backhaul compression for uplink cloud radio access network,” IEEE J. Sel. Areas Commun., vol. 32, no. 6, pp. 1295–1307, 2014.
- Y. Shi, Y. Zhou, D. Wen, Y. Wu, C. Jiang, and K. B. Letaief, “Task-Oriented Communications for 6G: Vision, Principles, and Technologies,” accepted to IEEE Wireless Commun. Mag., 2023.
- D. Wen, P. Liu, G. Zhu, Y. Shi, J. Xu, Y. C. Eldar, and S. Cui, “Task-oriented sensing, computation, and communication integration for multi-device edge ai,” IEEE Trans. Wireless Commun., 2023.
- J. Xiao, S. Cui, Z. Luo, and A. J. Goldsmith, “Power scheduling of universal decentralized estimation in sensor networks,” IEEE Trans. Signal Process., vol. 54, no. 2, pp. 413–422, 2006.
- J. Xiao and Z. Luo, “Decentralized estimation in an inhomogeneous sensing environment,” IEEE Trans. Inf. Theory, vol. 51, no. 10, pp. 3564–3575, 2005.
- G. Yang, J. Li, S. G. Zhou, and Y. Qi, “A wide-angle e-plane scanning linear array antenna with wide beam elements,” IEEE Antennas Wireless Propag. Lett., vol. 16, pp. 2923–2926, 2017.
- J. J. Xiao, S. Cui, Z. Q. Luo, and A. J. Goldsmith, “Power scheduling of universal decentralized estimation in sensor networks,” IEEE Trans. Signal Process., vol. 54, no. 2, pp. 413–422, 2006.
- G. J. McLachlan and S. I. Rathnayake, “On the number of components in a Gaussian mixture model,” WIREs Data Mining Knowl. Discov., vol. 4, no. 5, pp. 341–355, 2014.
- G. J. McLachlan, S. X. Lee, and S. I. Rathnayake, “Finite mixture models,” Annu. Rev. Statist. Its Appl., vol. 6, pp. 355–378, 2019.
- K. Yang, T. Jiang, Y. Shi, and Z. Ding, “Federated learning via Over-the-Air computation,” IEEE Trans. Wireless Commun., vol. 19, no. 3, pp. 2022–2035, 2020.
- G. Zhu, Y. Wang, and K. Huang, “Broadband analog aggregation for low-latency federated edge learning,” IEEE Trans. Wireless Commun., vol. 19, no. 1, pp. 491–506, 2019.
- X. Cao, G. Zhu, J. Xu, and K. Huang, “Optimized power control for over-the-air computation in fading channels,” IEEE Trans. Wirel. Commun., vol. 19, no. 11, pp. 7498–7513, 2020.
- W. Liu, X. Zang, Y. Li, and B. Vucetic, “Over-the-Air computation systems: Optimization, analysis and scaling laws,” IEEE Trans. on Wireless Commun., vol. 19, no. 8, pp. 5488–5502, 2020.
- A. Şahin and R. Yang, “A survey on over-the-air computation,” IEEE Commun. Surv. Tutorials, 2023.
- M. Peng, C. Wang, V. Lau, and H. V. Poor, “Fronthaul-Constrained Cloud Radio Access Networks: Insights and Challenges,” IEEE Wireless Commun., vol. 22, no. 2, pp. 152–160, 2015.
- Cambridge University Press, 2017.
- Courier Corporation, 1997.
- D. Wen, G. Zhu, and K. Huang, “Reduced-Dimension Design of MIMO Over-the-Air Computing for Data Aggregation in Clustered IoT Networks,” IEEE Trans. Wireless Commun., vol. 18, no. 11, pp. 5255–5268, 2019.
- A. Wiesel, Y. C. Eldar, and S. Shamai, “Zero-forcing precoding and generalized inverses,” IEEE Trans. Signal Process., vol. 56, no. 9, pp. 4409–4418, 2008.
- X. Li, G. Zhu, Y. Gong, and K. Huang, “Wirelessly powered data aggregation for IoT via over-the-air function computation: Beamforming and power control,” IEEE Trans. Wireless Commun., vol. 18, no. 7, pp. 3437–3452, 2019.
- M. Grant and S. Boyd, “CVX: Matlab software for disciplined convex programming, version 2.1.” http://cvxr.com/cvx, Mar. 2014.
- B. R. Marks and G. P. Wright, “A general inner approximation algorithm for nonconvex mathematical programs,” Operations research, vol. 26, no. 4, pp. 681–683, 1978.
- C. Sun, W. Ni, and X. Wang, “Joint computation offloading and trajectory planning for uav-assisted edge computing,” IEEE Trans. Wirel. Commun., vol. 20, no. 8, pp. 5343–5358, 2021.
- W. Lyu, Y. Xiu, J. Zhao, and Z. Zhang, “Optimizing the age of information in ris-aided SWIPT networks,” IEEE Trans. Veh. Technol., vol. 72, no. 2, pp. 2615–2619, 2023.
- G. Li, S. Wang, J. Li, R. Wang, X. Peng, and T. X. Han, “Wireless sensing with deep spectrogram network and primitive based autoregressive hybrid channel model,” in 2021 IEEE 22nd International Workshop on SPAWC, pp. 481–485, IEEE, 2021.
- H. Xiao, K. Rasul, and R. Vollgraf, “Fashion-mnist: a novel image dataset for benchmarking machine learning algorithms,” CoRR, vol. abs/1708.07747, 2017.
- A. Rea and W. Rea, “How many components should be retained from a multivariate time series PCA?,” arXiv preprint arXiv:1610.03588, 2016.
- H. Khalilian and I. V. Bajic, “Video Watermarking With Empirical PCA-Based Decoding,” IEEE Trans. Image Process., vol. 22, no. 12, pp. 4825–4840, 2013.