Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
91 tokens/sec
Gemini 2.5 Pro Premium
50 tokens/sec
GPT-5 Medium
27 tokens/sec
GPT-5 High Premium
19 tokens/sec
GPT-4o
103 tokens/sec
DeepSeek R1 via Azure Premium
82 tokens/sec
GPT OSS 120B via Groq Premium
458 tokens/sec
Kimi K2 via Groq Premium
209 tokens/sec
2000 character limit reached

DMVC: Multi-Camera Video Compression Network aimed at Improving Deep Learning Accuracy (2410.18400v1)

Published 24 Oct 2024 in cs.CV, cs.DC, and eess.IV

Abstract: We introduce a cutting-edge video compression framework tailored for the age of ubiquitous video data, uniquely designed to serve machine learning applications. Unlike traditional compression methods that prioritize human visual perception, our innovative approach focuses on preserving semantic information critical for deep learning accuracy, while efficiently reducing data size. The framework operates on a batch basis, capable of handling multiple video streams simultaneously, thereby enhancing scalability and processing efficiency. It features a dual reconstruction mode: lightweight for real-time applications requiring swift responses, and high-precision for scenarios where accuracy is crucial. Based on a designed deep learning algorithms, it adeptly segregates essential information from redundancy, ensuring machine learning tasks are fed with data of the highest relevance. Our experimental results, derived from diverse datasets including urban surveillance and autonomous vehicle navigation, showcase DMVC's superiority in maintaining or improving machine learning task accuracy, while achieving significant data compression. This breakthrough paves the way for smarter, scalable video analysis systems, promising immense potential across various applications from smart city infrastructure to autonomous systems, establishing a new benchmark for integrating video compression with machine learning.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (39)
  1. A perceptual quantization strategy for HEVC based on a convolutional neural network trained on natural images. In Applications of digital image processing XXXVIII, Vol. 9599. SPIE, 395–408.
  2. Overview of the versatile video coding (VVC) standard and its applications. IEEE Transactions on Circuits and Systems for Video Technology 31, 10 (2021), 3736–3764.
  3. Learning for video compression. IEEE Transactions on Circuits and Systems for Video Technology 30, 2 (2019), 566–576.
  4. Convolutional neural networks based intra prediction for HEVC. arXiv preprint arXiv:1808.05734 (2018).
  5. Low-rate image compression with super-resolution learning. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition workshops. 154–155.
  6. CrossRoI: cross-camera region of interest optimization for efficient real time video analytics at scale. In Proceedings of the 12th ACM Multimedia Systems Conference. 186–199.
  7. Video compression with rate-distortion autoencoders. In Proceedings of the IEEE/CVF International Conference on Computer Vision. 7033–7042.
  8. Hmfvc: A human-machine friendly video compression scheme. IEEE Transactions on Circuits and Systems for Video Technology (2022).
  9. Rexcam: Resource-efficient, cross-camera video analytics at scale. arXiv preprint arXiv:1811.01268 (2018).
  10. Chameleon: scalable adaptation of video analytics. In Proceedings of the 2018 Conference of the ACM Special Interest Group on Data Communication. 253–266.
  11. Fire and smoke detection in video with optimal mass transport based optical flow and neural networks. In 2010 IEEE International Conference on Image Processing. 761–764. https://doi.org/10.1109/ICIP.2010.5652119
  12. Handwritten zip code recognition with multilayer networks. In [1990] Proceedings. 10th International Conference on Pattern Recognition, Vol. ii. 35–40 vol.2. https://doi.org/10.1109/ICPR.1990.119325
  13. Convolutional neural network-based block up-sampling for intra frame coding. IEEE Transactions on Circuits and Systems for Video Technology 28, 9 (2017), 2316–2330.
  14. TAPU: A Transmission-Analytics Processing Unit for Accelerating Multifunctions in IoT Gateways. IEEE Internet of Things Journal 10, 20 (2023), 18181–18197. https://doi.org/10.1109/JIOT.2023.3279892
  15. M-LVC: Multiple frames prediction for learned video compression. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 3546–3554.
  16. Convolutional Neural Network-Based Block Up-Sampling for HEVC. IEEE Transactions on Circuits and Systems for Video Technology 29, 12 (2019), 3701–3715. https://doi.org/10.1109/TCSVT.2018.2884203
  17. Rt-mdl: Supporting real-time mixed deep learning tasks on edge platforms. In Proceedings of the 19th ACM conference on embedded networked sensor systems. 1–14.
  18. Caesar: cross-camera complex activity recognition. In Proceedings of the 17th Conference on Embedded Networked Sensor Systems. 232–244.
  19. Dvc: An end-to-end deep video compression framework. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition. 11006–11015.
  20. An end-to-end learning framework for video compression. IEEE transactions on pattern analysis and machine intelligence 43, 10 (2020), 3292–3308.
  21. A Feedback-Driven DNN Inference Acceleration System for Edge-Assisted Video Analytics. IEEE Trans. Comput. 72, 10 (2023), 2902–2912. https://doi.org/10.1109/TC.2023.3275094
  22. Supervised compression for resource-constrained edge computing systems. In Proceedings of the IEEE/CVF Winter Conference on Applications of Computer Vision. 2685–2695.
  23. David Minnen and Saurabh Singh. 2020. Channel-wise autoregressive entropy models for learned image compression. In 2020 IEEE International Conference on Image Processing (ICIP). IEEE, 3339–3343.
  24. A. N. Netravali and J. A. Stuller. 1979. Motion-compensated transform coding. The Bell System Technical Journal 58, 7 (1979), 1703–1718. https://doi.org/10.1002/j.1538-7305.1979.tb02277.x
  25. Neural network based intra prediction for video coding. In Applications of Digital Image Processing XLI, Vol. 10752. SPIE, 359–365.
  26. Video analytics for retail. In 2007 IEEE Conference on Advanced Video and Signal Based Surveillance. 423–428. https://doi.org/10.1109/AVSS.2007.4425348
  27. Temporal context mining for learned video compression. IEEE Transactions on Multimedia (2022).
  28. Unsupervised learning of video representations using lstms. In International conference on machine learning. PMLR, 843–852.
  29. Overview of the high efficiency video coding (HEVC) standard. IEEE Transactions on circuits and systems for video technology 22, 12 (2012), 1649–1668.
  30. Interframe coding that follows the motion. Proc. Institute of Electronics and Communication Engineers Jpn. Annu. Conv.(IECEJ) (1974), 1263.
  31. Attention is all you need. Advances in neural information processing systems 30 (2017).
  32. Yuri Vatis and Joern Ostermann. 2009. Adaptive Interpolation Filter for H.264/AVC. IEEE Transactions on Circuits and Systems for Video Technology 19, 2 (2009), 179–192. https://doi.org/10.1109/TCSVT.2008.2009259
  33. Overview of the H.264/AVC video coding standard. IEEE Transactions on Circuits and Systems for Video Technology 13, 7 (2003), 560–576. https://doi.org/10.1109/TCSVT.2003.815165
  34. Convolutional neural network-based fractional-pixel motion compensation. IEEE Transactions on Circuits and Systems for Video Technology 29, 3 (2018), 840–853.
  35. Learning for video compression with recurrent auto-encoder and recurrent probability model. IEEE Journal of Selected Topics in Signal Processing 15, 2 (2020), 388–401.
  36. Deeprt: A soft real time scheduler for computer vision applications on the edge. In 2021 IEEE/ACM Symposium on Edge Computing (SEC). IEEE, 271–284.
  37. Task-driven video compression for humans and machines: Framework design and optimization. IEEE Transactions on Multimedia (2022).
  38. AccDecoder: Accelerated Decoding for Neural-enhanced Video Analytics. In IEEE INFOCOM 2023 - IEEE Conference on Computer Communications. 1–10. https://doi.org/10.1109/INFOCOM53939.2023.10228933
  39. The design and implementation of a wireless video surveillance system. In Proceedings of the 21st annual international conference on mobile computing and networking. 426–438.

Summary

We haven't generated a summary for this paper yet.

Dice Question Streamline Icon: https://streamlinehq.com

Follow-up Questions

We haven't generated follow-up questions for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com