Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DVC-P: Deep Video Compression with Perceptual Optimizations (2109.10849v2)

Published 22 Sep 2021 in eess.IV, cs.CV, and cs.MM

Abstract: Recent years have witnessed the significant development of learning-based video compression methods, which aim at optimizing objective or perceptual quality and bit rates. In this paper, we introduce deep video compression with perceptual optimizations (DVC-P), which aims at increasing perceptual quality of decoded videos. Our proposed DVC-P is based on Deep Video Compression (DVC) network, but improves it with perceptual optimizations. Specifically, a discriminator network and a mixed loss are employed to help our network trade off among distortion, perception and rate. Furthermore, nearest-neighbor interpolation is used to eliminate checkerboard artifacts which can appear in sequences encoded with DVC frameworks. Thanks to these two improvements, the perceptual quality of decoded sequences is improved. Experimental results demonstrate that, compared with the baseline DVC, our proposed method can generate videos with higher perceptual quality achieving 12.27% reduction in a perceptual BD-rate equivalent, on average.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Saiping Zhang (6 papers)
  2. Marta Mrak (25 papers)
  3. Luis Herranz (46 papers)
  4. Marc Górriz (7 papers)
  5. Shuai Wan (16 papers)
  6. Fuzheng Yang (8 papers)
Citations (5)