Image Compression with Recurrent Neural Network and Generalized Divisive Normalization (2109.01999v1)

Published 5 Sep 2021 in eess.IV, cs.CV, and cs.MM

Abstract: Image compression is a method to remove spatial redundancy between adjacent pixels and reconstruct a high-quality image. In the past few years, deep learning has gained huge attention from the research community and produced promising image reconstruction results. Therefore, recent methods focused on developing deeper and more complex networks, which significantly increased network complexity. In this paper, two effective novel blocks are developed: analysis and synthesis block that employs the convolution layer and Generalized Divisive Normalization (GDN) in the variable-rate encoder and decoder side. Our network utilizes a pixel RNN approach for quantization. Furthermore, to improve the whole network, we encode a residual image using LSTM cells to reduce unnecessary information. Experimental results demonstrated that the proposed variable-rate framework with novel blocks outperforms existing methods and standard image codecs, such as George's ~\cite{002} and JPEG in terms of image similarity. The project page along with code and models are available at https://khawar512.github.io/cvpr/

Authors (4)

Khawar Islam (9 papers)
L. Minh Dang (1 paper)
Sujin Lee (14 papers)
Hyeonjoon Moon (3 papers)

Citations (27)

View on Semantic Scholar

Summary

We haven't generated a summary for this paper yet.

Summarize Now

Image Compression with Recurrent Neural Network and Generalized Divisive Normalization (2109.01999v1)

Summary

Related Papers