- The paper presents a novel approach that replaces quantization with integer matrix factorization in lossy image compression.
- It employs an iterative Block Coordinate Descent algorithm with closed-form updates to achieve low-rank approximation using integer matrices.
- Experimental results demonstrate higher PSNR, SSIM, and classification accuracy at low bit rates compared to traditional JPEG and SVD techniques.
Quantization-free Lossy Image Compression Using Integer Matrix Factorization
The paper "Quantization-free Lossy Image Compression Using Integer Matrix Factorization" introduces a novel approach to lossy image compression that bypasses the quantization step traditionally required by methods such as JPEG and those based on Singular Value Decomposition (SVD). The authors propose a methodology founded on Integer Matrix Factorization (IMF), which inherently operates without the need for quantization. This is achieved by representing image data as low-rank approximations involving integer values, thus potentially circumventing the quantization errors typically observed in other methods.
Introduction
In the context of image compression, traditional methods, including JPEG and JPEG 2000, use orthogonal linear transformations like Discrete Cosine Transform (DCT) and Discrete Wavelet Transform (DWT) to represent image data in a continuous domain. As a result, these methods require a quantization step prior to byte-level coding, which can introduce significant quantization errors. While SVD-based methods find application due to their energy compaction properties, they again face an acute sensitivity to quantization errors, particularly at low bit rates.
The paper addresses these limitations by proposing IMF as an alternative that maintains integer-only data representations. Through this novel approach, the researchers aim to deliver effective compression performance while minimizing reconstruction errors.
Method
Overall Encoding Framework
The proposed method consists of several key steps:
- Color Space Transformation: Images are first transformed to the YC\textsubscript{B}C\textsubscript{R} color space.
- Chroma Downsampling: The chroma components are downsampled.
- Patchification: The image is split into non-overlapping patches, which are then flattened and stacked to form matrices ready for factorization.
- Low-rank Approximation Using IMF: These matrices are then low-rank approximated using IMF, which factors them into products of smaller integer matrices.
- Lossless Compression: The integer matrices are directly encoded using a lossless compression standard.
Each of these steps is carefully designed to leverage integer matrix operations, ensuring that the entire pipeline remains quantization-free.
Integer Matrix Factorization (IMF)
IMF aims to approximate a given matrix X∈RM×N as a product UVT where U and V are integer matrices within specified bounds. The formulation is crucial because it eliminates the need for quantization, thus significantly reducing errors.
A unique iterative algorithm based on Block Coordinate Descent (BCD) is employed for solving the IMF problem. The authors derive closed-form solutions for updating each factor in this setting, ensuring the convergence and efficiency of the algorithm.
Experiments and Results
The proposed IMF method was extensively evaluated against JPEG and SVD-based compression methods using datasets such as Kodak, CLIC 2024, and the ImageNet validation set. Evaluation metrics included PSNR, SSIM, and classification accuracy for a pre-trained ImageNet classifier on compressed images.
The results indicated that IMF consistently outperformed JPEG and SVD, particularly at low bit rates. For instance, on the Kodak and CLIC 2024 datasets, IMF achieved higher PSNR and SSIM values at bit rates below 0.25 bpp, while remaining competitive at higher bit rates. Additionally, IMF demonstrated superior performance compared to SVD across all evaluated bit rates.
Qualitative analysis further highlighted the advantages of IMF compression. When compared at similar bit rates, IMF provided better preservation of color fidelity and texture, avoiding the typical artifacts seen in JPEG such as blocking and color bleeding.
IMF's ability to maintain semantic integrity in images was also verified. The ResNet-50 classifier achieved over 5% higher top-1 accuracy on IMF-compressed images compared to those compressed using JPEG, particularly noticeable at bit rates below 0.25 bpp.
Discussion
The research reveals that IMF provides a significant improvement over traditional methods by effectively balancing compression efficiency with image quality. The quantization-free nature of IMF addresses the inherent limitations of SVD and JPEG related to quantization errors. Moreover, the methodology's compatibility with existing lossless compression standards adds to its practical applicability.
Auxiliary ablation studies underscored the impact of various parameters such as factor bounds, the number of BCD iterations, and patch size on the performance, suggesting that optimal configuration is crucial for maximizing IMF's benefits.
Conclusion
The proposed IMF method sets a promising direction in the domain of lossy image compression by eliminating the need for quantization, thus enhancing reconstruction accuracy. The efficient iterative algorithm ensures practical feasibility, and empirical results affirm IMF's superior performance in maintaining visual and semantic quality at lower bit rates. Future developments may incorporate entropy-aware regularization to further refine compression efficiency and mitigate any artifacts associated with patchification.