Delay-Optimal Forwarding and Computation Offloading for Service Chain Tasks (2403.15936v1)
Abstract: Emerging edge computing paradigms enable heterogeneous devices to collaborate on complex computation applications. However, for congestible links and computing units, delay-optimal forwarding and offloading for service chain tasks (e.g., DNN with vertical split) in edge computing networks remains an open problem. In this paper, we formulate the service chain forwarding and offloading problem with arbitrary topology and heterogeneous transmission/computation capability, and aim to minimize the aggregated network cost. We consider congestion-aware nonlinear cost functions that cover various performance metrics and constraints, such as average queueing delay with limited processor capacity. We solve the non-convex optimization problem globally by analyzing the KKT condition and proposing a sufficient condition for optimality. We then propose a distributed algorithm that converges to the global optimum. The algorithm adapts to changes in input rates and network topology, and can be implemented as an online algorithm. Numerical evaluation shows that our method significantly outperforms baselines in multiple network instances, especially in congested scenarios.
- Ericsson. Ericsson mobility report (2021, Nov.). [Online]. Available: https://www.ericsson.com/en/reports-and-papers/mobility-report
- Y. Sahni, J. Cao, L. Yang, and Y. Ji, “Multi-hop multi-task partial computation offloading in collaborative edge computing,” IEEE Transactions on Parallel and Distributed Systems, vol. 32, no. 5, pp. 1133–1145, 2020.
- Y. Sahni, J. Cao, S. Zhang, and L. Yang, “Edge mesh: A new paradigm to enable distributed intelligence in internet of things,” IEEE access, vol. 5, pp. 16 441–16 458, 2017.
- K. Zhu, W. Zhi, X. Chen, and L. Zhang, “Socially motivated data caching in ultra-dense small cell networks,” IEEE Network, vol. 31, no. 4, pp. 42–48, 2017.
- Z. Hong, W. Chen, H. Huang, S. Guo, and Z. Zheng, “Multi-hop cooperative computation offloading for industrial iot–edge–cloud computing environments,” IEEE Transactions on Parallel and Distributed Systems, vol. 30, no. 12, pp. 2759–2774, 2019.
- J. Zhang, Y. Liu, and E. Yeh, “Optimal congestion-aware routing and offloading in collaborative edge computing,” in 2022 20th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt). IEEE, 2022, pp. 121–128.
- B. Han, V. Gopalakrishnan, L. Ji, and S. Lee, “Network function virtualization: Challenges and opportunities for innovations,” IEEE communications magazine, vol. 53, no. 2, pp. 90–97, 2015.
- J. Zhang, A. Sinha, J. Llorca, A. M. Tulino, and E. Modiano, “Optimal control of distributed computing networks with mixed-cast traffic flows,” IEEE/ACM Transactions on Networking, vol. 29, no. 4, pp. 1760–1773, 2021.
- Y. Wang, M. Sheng, X. Wang, L. Wang, and J. Li, “Mobile-edge computing: Partial computation offloading using dynamic voltage scaling,” IEEE Transactions on Communications, vol. 64, no. 10, 2016.
- R. Gallager, “A minimum delay routing algorithm using distributed computation,” IEEE transactions on communications, vol. 25, 1977.
- Y. Xi and E. M. Yeh, “Node-based optimal power control, routing, and congestion control in wireless networks,” IEEE Transactions on Information Theory, vol. 54, no. 9, pp. 4081–4106, 2008.
- M. Mahdian and E. Yeh, “Mindelay: Low-latency joint caching and forwarding for multi-hop networks,” in 2018 IEEE International Conference on Communications (ICC). IEEE, 2018, pp. 1–7.
- J. Zhang and E. Yeh, “Congestion-aware routing and content placement in elastic cache networks,” arXiv preprint arXiv:2303.01648, 2023.
- Y. Sahni, J. Cao, and L. Yang, “Data-aware task allocation for achieving low latency in collaborative edge computing,” IEEE Internet of Things Journal, vol. 6, no. 2, pp. 3512–3524, 2018.
- B. Liu, Y. Cao, Y. Zhang, and T. Jiang, “A distributed framework for task offloading in edge computing networks of arbitrary topology,” IEEE Transactions on Wireless Communications, vol. 19, no. 4, 2020.
- H. Al-Shatri, S. Müller, and A. Klein, “Distributed algorithm for energy efficient multi-hop computation offloading,” in 2016 IEEE International Conference on Communications (ICC). IEEE, 2016, pp. 1–6.
- Q. Luo, W. Shi, and P. Fan, “Qoe-driven computation offloading: Performance analysis and adaptive method,” in 2021 13th International Conference on Wireless Communications and Signal Processing (WCSP). IEEE, 2021, pp. 1–5.
- X. He, R. Jin, and H. Dai, “Multi-hop task offloading with on-the-fly computation for multi-uav remote edge computing,” IEEE Transactions on Communications, 2021.
- C. Funai, C. Tapparello, and W. Heinzelman, “Computational offloading for energy constrained devices in multi-hop cooperative networks,” IEEE Transactions on Mobile Computing, vol. 19, no. 1, pp. 60–73, 2019.
- B. McMahan, E. Moore, D. Ramage, S. Hampson, and B. A. y Arcas, “Communication-efficient learning of deep networks from decentralized data,” in Artificial intelligence and statistics. PMLR, 2017.
- Z. Hong, H. Huang, S. Guo, W. Chen, and Z. Zheng, “Qos-aware cooperative computation offloading for robot swarms in cloud robotics,” IEEE Transactions on Vehicular Technology, vol. 68, no. 4, 2019.
- B. Xiang, J. Elias, F. Martignon, and E. Di Nitto, “Joint planning of network slicing and mobile edge computing: Models and algorithms,” arXiv preprint arXiv:2005.07301, 2020.
- K. Kamran, E. Yeh, and Q. Ma, “Deco: Joint computation, caching and forwarding in data-centric computing networks,” in Proceedings of the Twentieth ACM International Symposium on Mobile Ad Hoc Networking and Computing, 2019, pp. 111–120.
- E. Yeh, T. Ho, Y. Cui, M. Burd, R. Liu, and D. Leong, “Vip: A framework for joint dynamic forwarding and caching in named data networks,” in Proceedings of the 1st ACM Conference on Information-Centric Networking, 2014, pp. 117–126.
- L. Tassiulas and A. Ephremides, “Stability properties of constrained queueing systems and scheduling policies for maximum throughput in multihop radio networks,” in 29th IEEE Conference on Decision and Control. IEEE, 1990, pp. 2130–2132.
- J. Halpern and C. Pignataro, “Service function chaining (sfc) architecture,” Tech. Rep., 2015.
- S. G. Kulkarni, W. Zhang, J. Hwang, S. Rajagopalan, K. Ramakrishnan, T. Wood, M. Arumaithurai, and X. Fu, “Nfvnice: Dynamic backpressure and scheduling for nfv service chains,” in Proceedings of the conference of the ACM special interest group on data communication, 2017, pp. 71–84.
- L. Qu, C. Assi, and K. Shaban, “Delay-aware scheduling and resource optimization with network function virtualization,” IEEE Transactions on communications, vol. 64, no. 9, pp. 3746–3758, 2016.
- F. Khoramnejad, R. Joda, and M. Erol-Kantarci, “Distributed multi-agent learning for service function chain partial offloading at the edge,” in 2021 IEEE International Conference on Communications Workshops (ICC Workshops). IEEE, 2021, pp. 1–6.
- T. Wang, J. Zu, G. Hu, and D. Peng, “Adaptive service function chain scheduling in mobile edge computing via deep reinforcement learning,” IEEE Access, vol. 8, pp. 164 922–164 935, 2020.
- C.-S. Yang, R. Pedarsani, and A. S. Avestimehr, “Communication-aware scheduling of serial tasks for dispersed computing,” IEEE/ACM Transactions on Networking, vol. 27, no. 4, pp. 1330–1343, 2019.
- S. Li, M. A. Maddah-Ali, and A. S. Avestimehr, “Coding for distributed fog computing,” IEEE Communications Magazine, vol. 55, no. 4, pp. 34–40, 2017.
- P. Ghosh, Q. Nguyen, P. K. Sakulkar, J. A. Tran, A. Knezevic, J. Wang, Z. Lin, B. Krishnamachari, M. Annavaram, and S. Avestimehr, “Jupiter: a networked computing architecture,” in Proceedings of the 14th IEEE/ACM International Conference on Utility and Cloud Computing Companion, 2021, pp. 1–8.
- J. Zhang, A. Sinha, J. Llorca, A. Tulino, and E. Modiano, “Optimal control of distributed computing networks with mixed-cast traffic flows,” in IEEE INFOCOM 2018-IEEE Conference on Computer Communications. IEEE, 2018, pp. 1880–1888.
- S. Ioannidis and E. Yeh, “Jointly optimal routing and caching for arbitrary network topologies,” IEEE Journal on Selected Areas in Communications, vol. 36, no. 6, pp. 1258–1275, 2018.
- B. Liu, K. Poularakis, L. Tassiulas, and T. Jiang, “Joint caching and routing in congestible networks of arbitrary topology,” IEEE Internet of Things Journal, vol. 6, no. 6, pp. 10 105–10 118, 2019.
- Z. Chen, Q. Ma, L. Gao, and X. Chen, “Edgeconomics: Price competition and selfish computation offloading in multi-server edge computing networks,” in 2021 19th International Symposium on Modeling and Optimization in Mobile, Ad hoc, and Wireless Networks (WiOpt).
- H. Zhang and S. Sra, “First-order methods for geodesically convex optimization,” in Conference on Learning Theory. PMLR, 2016, pp. 1617–1638.
- D. P. Bertsekas, “Nonlinear programming,” Journal of the Operational Research Society, vol. 48, no. 3, pp. 334–334, 1997.
- S. Jana and C. Nahak, “Convex optimization on riemannian manifolds,” 2020.
- J. Zhang, “Joint-Routing-and-Computation-2022.” [Online]. Available: https://github.com/JinkunZhang/Joint-Routing-and-Computation-2022
- D. Rossi and G. Rossini, “Caching performance of content centric networks under multi-path routing (and more),” Relatório técnico, Telecom ParisTech, vol. 2011, pp. 1–6, 2011.
- J. Kleinberg, “The small-world phenomenon: An algorithmic perspective,” in Proceedings of the thirty-second annual ACM symposium on Theory of computing, 2000, pp. 163–170.
- Y. Liu, Y. Li, Q. Ma, S. Ioannidis, and E. Yeh, “Fair caching networks,” ACM SIGMETRICS Performance Evaluation Review, vol. 48, no. 3, pp. 89–90, 2021.
- F. Kelly, “Charging and rate control for elastic traffic,” European transactions on Telecommunications, vol. 8, no. 1, pp. 33–37, 1997.
- D. Bertsekas, E. Gafni, and R. Gallager, “Second derivative algorithms for minimum delay distributed routing in networks,” IEEE Transactions on Communications, vol. 32, no. 8, pp. 911–919, 1984.