Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
97 tokens/sec
GPT-4o
53 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
5 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learned Video Compression with Feature-level Residuals (2004.08283v2)

Published 17 Apr 2020 in eess.IV

Abstract: In this paper, we present an end-to-end video compression network for P-frame challenge on CLIC. We focus on deep neural network (DNN) based video compression, and improve the current frameworks from three aspects. First, we notice that pixel space residuals is sensitive to the prediction errors of optical flow based motion compensation. To suppress the relative influence, we propose to compress the residuals of image feature rather than the residuals of image pixels. Furthermore, we combine the advantages of both pixel-level and feature-level residual compression methods by model ensembling. Finally, we propose a step-by-step training strategy to improve the training efficiency of the whole framework. Experiment results indicate that our proposed method achieves 0.9968 MS-SSIM on CLIC validation set and 0.9967 MS-SSIM on test set.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (6)
  1. Runsen Feng (15 papers)
  2. Yaojun Wu (11 papers)
  3. Zongyu Guo (19 papers)
  4. Zhizheng Zhang (60 papers)
  5. Xin Jin (285 papers)
  6. Zhibo Chen (176 papers)
Citations (26)

Summary

We haven't generated a summary for this paper yet.