Fidelity-preserving Learning-Based Image Compression: Loss Function and Subjective Evaluation Methodology (2403.11241v1)
Abstract: Learning-based image compression methods have emerged as state-of-the-art, showcasing higher performance compared to conventional compression solutions. These data-driven approaches aim to learn the parameters of a neural network model through iterative training on large amounts of data. The optimization process typically involves minimizing the distortion between the decoded and the original ground truth images. This paper focuses on perceptual optimization of learning-based image compression solutions and proposes: i) novel loss function to be used during training and ii) novel subjective test methodology that aims to evaluate the decoded image fidelity. According to experimental results from the subjective test taken with the new methodology, the optimization procedure can enhance image quality for low-rates while offering no advantage for high-rates.
- A. Creswell, T. White, V. Dumoulin, K. Arulkumaran, B. Sengupta, and A. A. Bharath, “Generative adversarial networks: an overview,” IEEE Signal Processing Magazine, vol. 35, no. 1, pp. 53–65, 2018.
- Y. Blau and T. Michaeli, “Rethinking lossy compression: The rate-distortion-perception tradeoff,” in International Conference on Machine Learning, pp. 675–685, PMLR, 2019.
- F. Mentzer, G. D. Toderici, M. Tschannen, and E. Agustsson, “High-fidelity generative image compression,” Advances in Neural Information Processing Systems, vol. 33, pp. 11913–11924, 2020.
- S. Lao, Y. Gong, S. Shi, S. Yang, T. Wu, J. Wang, W. Xia, and Y. Yang, “Attentions help CNNs see better: attention-based hybrid image quality assessment network,” in IEEE International Conference on Computer Vision and Pattern Recognition, (Orleans, LA, USA), 2022.
- ISO/IEC JTC 1/SC29/WG1 M96053, “Bytedance’s response to the JPEG AI call for proposals,” in 96th WG1 Meeting, Online, July 2022.
- R. Zhang, P. Isola, A. A. Efros, E. Shechtman, and O. Wang, “The unreasonable effectiveness of deep features as a perceptual metric,” in International Conference on Computer Vision and Pattern Recognition, (Salt Lake City, UT, USA), 2018.
- ISO/IEC WG1, “Information technology — advanced image coding and evaluation — part 2: Evaluation procedure for nearly lossless coding,” ISO/IEC 29170-2, 2015.
- Z. Wang, E. Simoncelli, and A. Bovik, “Multiscale structural similarity for image quality assessment,” in Asilomar Conference on Signals, Systems & Computers, (Pacific Grove, CA, USA), 2003.
- O. Rippel and L. Bourdev, “Real-time adaptive image compression,” in International Conference on Machine Learning, vol. 70, (Sydney, Australia), Aug 2017.
- E. Agustsson, M. Tschannen, F. Mentzer, R. Timofte, and L. V. Gool, “Generative adversarial networks for extreme learned image compression,” in International Conference on Computer Vision, (Seoul, Korea), 2019.
- D. He, Z. Yang, H. Yu, T. Xu, J. Luo, Y. Chen, C. Gao, X. Shi, H. Qin, and Y. Wang, “PO-ELIC: Perception-oriented efficient learned image coding,” in IEEE International Conference on Computer Vision and Pattern Recognition, (New Orleans, LA, USA), 2022.
- S. Mohammadi and J. Ascenso, “Perceptual impact of the loss function on deep-learning image coding performance,” in Picture Coding Symposium, (San Jose, CA, USA), 2022.
- B. Sun, M. Gu, D. He, T. Xu, Y. Wang, and H. Qin, “HLIC: Harmonizing optimization metrics in learned image compression by reinforcement learning,” arXiv preprint arXiv:2109.14863, 2021.
- L.-H. Chen, C. G. Bampis, Z. Li, A. Norkin, and A. C. Bovik, “ProxIQA: a proxy approach to perceptual optimization of learned image compression,” IEEE Transactions on Image Processing, vol. 30, p. 360–373, 2021.
- R. Rassool, “VMAF reproducibility: validating a perceptual practical video quality metric,” in IEEE International Symposium on Broadband Multimedia Systems and Broadcasting, (Cagliari, Italy), 2017.
- J. Bégaint, F. Racapé, S. Feltman, and A. Pushparaja, “CompressAI: a pytorch library and evaluation platform for end-to-end compression research,” arXiv preprint arXiv:2011.03029, 2020.
- H. Men, H. Lin, M. Jenadeleh, and D. Saupe, “Subjective image quality assessment with boosted triplet comparisons,” IEEE Access, vol. 9, pp. 138939–138975, 2021.
- M. Filipa, “Subjective test.” https://github.com/martafilipa/subjective_test/.
- ISO/IEC JTC 1/SC29/WG1 N100106, “JPEG AI common training and test conditions,” in 94th WG1 Meeting, Online, January 2022.
- Shima Mohammadi (8 papers)
- Yaojun Wu (11 papers)
- João Ascenso (66 papers)