Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

End-to-end Optimized Video Compression with MV-Residual Prediction (2005.12945v1)

Published 26 May 2020 in eess.IV and cs.CV

Abstract: We present an end-to-end trainable framework for P-frame compression in this paper. A joint motion vector (MV) and residual prediction network MV-Residual is designed to extract the ensembled features of motion representations and residual information by treating the two successive frames as inputs. The prior probability of the latent representations is modeled by a hyperprior autoencoder and trained jointly with the MV-Residual network. Specially, the spatially-displaced convolution is applied for video frame prediction, in which a motion kernel for each pixel is learned to generate predicted pixel by applying the kernel at a displaced location in the source image. Finally, novel rate allocation and post-processing strategies are used to produce the final compressed bits, considering the bits constraint of the challenge. The experimental results on validation set show that the proposed optimized framework can generate the highest MS-SSIM for P-frame compression competition.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. XiangJi Wu (2 papers)
  2. Ziwen Zhang (11 papers)
  3. Jie Feng (103 papers)
  4. Lei Zhou (126 papers)
  5. Junmin Wu (4 papers)
Citations (2)

Summary

We haven't generated a summary for this paper yet.