Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Leveraging Bitstream Metadata for Fast, Accurate, Generalized Compressed Video Quality Enhancement (2202.00011v3)

Published 31 Jan 2022 in eess.IV, cs.CV, and cs.LG

Abstract: Video compression is a central feature of the modern internet powering technologies from social media to video conferencing. While video compression continues to mature, for many compression settings, quality loss is still noticeable. These settings nevertheless have important applications to the efficient transmission of videos over bandwidth constrained or otherwise unstable connections. In this work, we develop a deep learning architecture capable of restoring detail to compressed videos which leverages the underlying structure and motion information embedded in the video bitstream. We show that this improves restoration accuracy compared to prior compression correction methods and is competitive when compared with recent deep-learning-based video compression methods on rate-distortion while achieving higher throughput. Furthermore, we condition our model on quantization data which is readily available in the bitstream. This allows our single model to handle a variety of different compression quality settings which required an ensemble of models in prior work.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (7)
  1. Max Ehrlich (14 papers)
  2. Jon Barker (26 papers)
  3. Namitha Padmanabhan (5 papers)
  4. Larry Davis (41 papers)
  5. Andrew Tao (40 papers)
  6. Bryan Catanzaro (123 papers)
  7. Abhinav Shrivastava (120 papers)
Citations (1)

Summary

We haven't generated a summary for this paper yet.