Variable-Rate Learned Image Compression with Multi-Objective Optimization and Quantization-Reconstruction Offsets (2402.18930v1)
Abstract: Achieving successful variable bitrate compression with computationally simple algorithms from a single end-to-end learned image or video compression model remains a challenge. Many approaches have been proposed, including conditional auto-encoders, channel-adaptive gains for the latent tensor or uniformly quantizing all elements of the latent tensor. This paper follows the traditional approach to vary a single quantization step size to perform uniform quantization of all latent tensor elements. However, three modifications are proposed to improve the variable rate compression performance. First, multi objective optimization is used for (post) training. Second, a quantization-reconstruction offset is introduced into the quantization operation. Third, variable rate quantization is used also for the hyper latent. All these modifications can be made on a pre-trained single-rate compression model by performing post training. The algorithms are implemented into three well-known image compression models and the achieved variable rate compression results indicate negligible or minimal compression performance loss compared to training multiple models. (Codes will be shared at https://github.com/InterDigitalInc/CompressAI)
- “Elic: Efficient learned image compression with unevenly grouped space-channel contextual adaptive coding,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 5718–5727.
- “Neural video compression with diverse contexts,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 22616–22626.
- “An overview of core coding tools in the av1 video codec,” in 2018 picture coding symposium (PCS). IEEE, 2018, pp. 41–45.
- “End-to-end optimized image compression,” in 5th International Conference on Learning Representations, ICLR 2017, 2017.
- “Variational image compression with a scale hyperprior,” in International Conference on Learning Representations, 2018.
- “Joint autoregressive and hierarchical priors for learned image compression,” Advances in neural information processing systems, vol. 31, 2018.
- “Variable rate deep image compression with a conditional autoencoder,” in Proceedings of the IEEE/CVF International Conference on Computer Vision, 2019, pp. 3146–3154.
- “A deeply modulated scheme for variable-rate video compression,” in 2021 IEEE International Conference on Image Processing (ICIP). IEEE, 2021, pp. 3722–3726.
- “Asymmetric gained deep image compression with continuous rate adaptation,” in Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2021, pp. 10532–10541.
- “Qvrf: A quantization-error-aware variable rate framework for learned image compression,” arXiv preprint arXiv:2303.05744, 2023.
- “Multi-task learning as multi-objective optimization,” Advances in neural information processing systems, vol. 31, 2018.
- Jean-Antoine Désidéri, “Multiple-gradient descent algorithm (mgda) for multiobjective optimization,” Comptes Rendus Mathematique, vol. 350, no. 5-6, pp. 313–318, 2012.
- “Nonlinear transform coding,” IEEE Journal of Selected Topics in Signal Processing, vol. 15, no. 2, pp. 339–353, 2020.
- “Compressai: a pytorch library and evaluation platform for end-to-end compression research,” arXiv preprint arXiv:2011.03029, 2020.
- “Video enhancement with task-oriented flow,” International Journal of Computer Vision (IJCV), vol. 127, no. 8, pp. 1106–1125, 2019.
- Fatih Kamisli (10 papers)
- Hyomin Choi (20 papers)
- Fabien Racape (3 papers)