Deep Learning-Based Real-Time Rate Control for Live Streaming on Wireless Networks (2310.06857v1)
Abstract: Providing wireless users with high-quality video content has become increasingly important. However, ensuring consistent video quality poses challenges due to variable encoded bitrate caused by dynamic video content and fluctuating channel bitrate caused by wireless fading effects. Suboptimal selection of encoder parameters can lead to video quality loss due to underutilized bandwidth or the introduction of video artifacts due to packet loss. To address this, a real-time deep learning based H.264 controller is proposed. This controller leverages instantaneous channel quality data driven from the physical layer, along with the video chunk, to dynamically estimate the optimal encoder parameters with a negligible delay in real-time. The objective is to maintain an encoded video bitrate slightly below the available channel bitrate. Experimental results, conducted on both QCIF dataset and a diverse selection of random videos from public datasets, validate the effectiveness of the approach. Remarkably, improvements of 10-20 dB in PSNR with repect to the state-of-the-art adaptive bitrate video streaming is achieved, with an average packet drop rate as low as 0.002.
- Overview of the h. 264/avc video coding standard. IEEE Transactions on Circ. and Sys. for Video Tech., 13(7):560–576, July 2003.
- T. Stockhammer. Dynamic adaptive streaming over http– standards and design principles. In Proceedings of the second annual ACM conference on Multimedia systems, pages 133–144, 2011.
- Apple HTTP Live Streaming. Accessed: Sep. 25, 2014. [Online]. Available: https://developer.apple.com/resources/http-streaming/.
- Microsoft Smooth Streaming. Accessed: Sep. 25, 2014. [Online]. Available: http://www.iis.net/downloads/microsoft/smooth-streaming.
- Adobe HTTP Dynamic Streaming. Accessed: Sep. 25, 2014. [Online]. Available: http://www.adobe.com/products/hds-dynamic-streaming.html.
- 3GPP TS36.213 V12.3.0. Evolved universal terrestrial radio access (e-utra); physical layer procedures.
- C. Feichtenhofer. X3d: Expanding architectures for efficient video recognition. In CVPR, pages 203–213, June 2020.
- Y. Wu and K. He. Group normalization. In ECCV, September 2018.
- D. Hendrycks and K. Gimpel. Gaussian error linear units (gelus). Available online at arXiv:1606.08415.
- Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on Circ. and Sys. for Video Tech., 22(12):1649–1668, September 2012.
- QCIF dataset. Available: http://vip.cs.nctu.edu.tw/resource_seq.html.