Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
102 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

A Residual Diffusion Model for High Perceptual Quality Codec Augmentation (2301.05489v3)

Published 13 Jan 2023 in cs.CV and eess.IV

Abstract: Diffusion probabilistic models have recently achieved remarkable success in generating high quality image and video data. In this work, we build on this class of generative models and introduce a method for lossy compression of high resolution images. The resulting codec, which we call DIffuson-based Residual Augmentation Codec (DIRAC), is the first neural codec to allow smooth traversal of the rate-distortion-perception tradeoff at test time, while obtaining competitive performance with GAN-based methods in perceptual quality. Furthermore, while sampling from diffusion probabilistic models is notoriously expensive, we show that in the compression setting the number of steps can be drastically reduced.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (5)
  1. Noor Fathima Ghouse (1 paper)
  2. Jens Petersen (46 papers)
  3. Auke Wiggers (13 papers)
  4. Tianlin Xu (7 papers)
  5. Guillaume Sautière (4 papers)
Citations (17)
X Twitter Logo Streamline Icon: https://streamlinehq.com