Edge Computing Enabled Real-Time Video Analysis via Adaptive Spatial-Temporal Semantic Filtering (2402.18927v1)
Abstract: This paper proposes a novel edge computing enabled real-time video analysis system for intelligent visual devices. The proposed system consists of a tracking-assisted object detection module (TAODM) and a region of interesting module (ROIM). TAODM adaptively determines the offloading decision to process each video frame locally with a tracking algorithm or to offload it to the edge server inferred by an object detection model. ROIM determines each offloading frame's resolution and detection model configuration to ensure that the analysis results can return in time. TAODM and ROIM interact jointly to filter the repetitive spatial-temporal semantic information to maximize the processing rate while ensuring high video analysis accuracy. Unlike most existing works, this paper investigates the real-time video analysis systems where the intelligent visual device connects to the edge server through a wireless network with fluctuating network conditions. We decompose the real-time video analysis problem into the offloading decision and configurations selection sub-problems. To solve these two sub-problems, we introduce a double deep Q network (DDQN) based offloading approach and a contextual multi-armed bandit (CMAB) based adaptive configurations selection approach, respectively. A DDQN-CMAB reinforcement learning (DCRL) training framework is further developed to integrate these two approaches to improve the overall video analyzing performance. Extensive simulations are conducted to evaluate the performance of the proposed solution, and demonstrate its superiority over counterparts.
- J. Chen, C. Yi, et al., “Networking architecture and key supporting technologies for human digital twin in personalized healthcare: A comprehensive survey,” IEEE Commun. Surv. Tutor., pp. 1–1, 2023.
- H. Liu and G. Cao, “Deep learning video analytics through online learning based edge computing,” IEEE Trans. Wirel. Commun., vol. 21, no. 10, pp. 8193–8204, 2022.
- Y. Shi, C. Yi, R. Wang et al., “Service migration or task rerouting: A two-timescale online resource optimization for mec,” IEEE Trans. Wirel. Commun., pp. 1–1, 2023.
- L. Dong, Z. Yang et al., “WAVE: Edge-device cooperated real-time object detection for open-air applications,” IEEE Trans. Mob. Comput., pp. 1–1, 2022.
- Y. Xu, X. Liu et al., “Cross-view people tracking by scene-centered spatio-temporal parsing,” in Proc. AAAI, vol. 31, no. 1, 2017.
- J. F. Henriques, R. Caseiro et al., “High-speed tracking with kernelized correlation filters,” IEEE Trans. Pattern Anal. Mach. Intell., vol. 37, no. 3, pp. 583–596, 2015.
- A. Lukežic, T. Vojír et al., “Discriminative correlation filter with channel and spatial reliability,” in Proc. IEEE CVPR, 2017, pp. 4847–4856.
- K. Zhao, Z. Zhou et al., “Edgeadaptor: Online configuration adaption, model selection and resource provisioning for edge DNN inference serving at scale,” IEEE Trans. Mob. Comput., pp. 1–16, 2022.
- J. Chen, C. Yi et al., “Learning aided joint sensor activation and mobile charging vehicle scheduling for energy-efficient wrsn-based industrial iot,” IEEE Trans. Veh. Technol., vol. 72, no. 4, pp. 5064–5078, 2023.
- R. Chen, C. Yi, K. Zhu et al., “A three-party hierarchical game for physical layer security aware wireless communications with dynamic trilateral coalitions,” IEEE Trans. Wirel. Commun., pp. 1–1, 2023.