Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
110 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
44 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

Learned Variable-Rate Multi-Frequency Image Compression using Modulated Generalized Octave Convolution (2009.13074v1)

Published 25 Sep 2020 in eess.IV

Abstract: In this proposal, we design a learned multi-frequency image compression approach that uses generalized octave convolutions to factorize the latent representations into high-frequency (HF) and low-frequency (LF) components, and the LF components have lower resolution than HF components, which can improve the rate-distortion performance, similar to wavelet transform. Moreover, compared to the original octave convolution, the proposed generalized octave convolution (GoConv) and octave transposed-convolution (GoTConv) with internal activation layers preserve more spatial structure of the information, and enable more effective filtering between the HF and LF components, which further improve the performance. In addition, we develop a variable-rate scheme using the Lagrangian parameter to modulate all the internal feature maps in the auto-encoder, which allows the scheme to achieve the large bitrate range of the JPEG AI with only three models. Experiments show that the proposed scheme achieves much better Y MS-SSIM than VVC. In terms of YUV PSNR, our scheme is very similar to HEVC.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (10)
  1. Jianping Lin (7 papers)
  2. Mohammad Akbari (43 papers)
  3. Haisheng Fu (15 papers)
  4. Qian Zhang (308 papers)
  5. Shang Wang (25 papers)
  6. Jie Liang (82 papers)
  7. Dong Liu (267 papers)
  8. Feng Liang (61 papers)
  9. Guohe Zhang (7 papers)
  10. Chengjie Tu (6 papers)
Citations (19)

Summary

We haven't generated a summary for this paper yet.