Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Predictive Coding For Animation-Based Video Compression (2307.04187v1)

Published 9 Jul 2023 in cs.CV and cs.MM

Abstract: We address the problem of efficiently compressing video for conferencing-type applications. We build on recent approaches based on image animation, which can achieve good reconstruction quality at very low bitrate by representing face motions with a compact set of sparse keypoints. However, these methods encode video in a frame-by-frame fashion, i.e. each frame is reconstructed from a reference frame, which limits the reconstruction quality when the bandwidth is larger. Instead, we propose a predictive coding scheme which uses image animation as a predictor, and codes the residual with respect to the actual target frame. The residuals can be in turn coded in a predictive manner, thus removing efficiently temporal dependencies. Our experiments indicate a significant bitrate gain, in excess of 70% compared to the HEVC video standard and over 30% compared to VVC, on a datasetof talking-head videos

Definition Search Book Streamline Icon: https://streamlinehq.com
References (18)
  1. “Ultra-low bitrate video conferencing using deep image animation,” in IEEE ICASSP, 2020.
  2. “A hybrid animation codec for low bitrate video conferencing,” in IEEE ICIP, 2022.
  3. “One-shot free-view neural talking-head synthesis for video conferencing,” in CVPR, 2021.
  4. “Low bandwidth video-chat compression using deep generative models,” arXiv preprint arXiv:2012.00328, 2020.
  5. “Beyond keypoint coding: Temporal evolution inference with compact feature representation for talking face video compression,” in DCC, 2022.
  6. “Robust ultralow bitrate video conferencing with second order motion coherency,” in 2022 IEEE 24th International Workshop on Multimedia Signal Processing (MMSP), 2022, pp. 1–6.
  7. “First order motion model for image animation,” in Neurips, 2019.
  8. “Real-time adaptive image compression,” arXiv preprint arXiv:1705.05823, 2017.
  9. “Variational image compression with a scale hyperprior,” arXiv preprint arXiv:1802.01436, 2018.
  10. “Generative adversarial networks for extreme learned image compression,” in IEEE ICCV, 2019.
  11. “Fvc: A new frame- work towards deep video compression in feature space,” in IEEE Conference on Computer Vision and Pattern Recognition(CVPR), 2021.
  12. “End-to-end optimized image compression,” Neurips, 2016.
  13. “Dvc: An end-to-end deep video compression framework,” in IEEE CVPR, 2019.
  14. “Vct: A video compression transformer,” in NeurIPS, 2022.
  15. J. Cleary and I. Witten, “Data compression using adaptive coding and partial string matching,” IEEE Transactions on Communications, vol. 32, no. 4, pp. 396–402, 1984.
  16. “The unreasonable effectiveness of deep features as a perceptual metric,” in CVPR, 2018.
  17. “Very deep convolutional networks for large scale image recognition,” in ICLR, 2015.
  18. “Image quality assessment: Unifying structure and texture similarity,” in IEEE Transactions on Pattern Analysis and Machine Intelligence, 2022, vol. 44, pp. 2567–2581.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Goluck Konuko (5 papers)
  2. Stéphane Lathuilière (79 papers)
  3. Giuseppe Valenzise (23 papers)
Citations (6)

Summary

We haven't generated a summary for this paper yet.