Color Learning for Image Compression (2306.17460v1)
Abstract: Deep learning based image compression has gained a lot of momentum in recent times. To enable a method that is suitable for image compression and subsequently extended to video compression, we propose a novel deep learning model architecture, where the task of image compression is divided into two sub-tasks, learning structural information from luminance channel and color from chrominance channels. The model has two separate branches to process the luminance and chrominance components. The color difference metric CIEDE2000 is employed in the loss function to optimize the model for color fidelity. We demonstrate the benefits of our approach and compare the performance to other codecs. Additionally, the visualization and analysis of latent channel impulse response is performed.
- G.K. Wallace, “The jpeg still picture compression standard,” IEEE Transactions on Consumer Electronics, vol. 38, no. 1, pp. xviii–xxxiv, 1992.
- “Overview of the high efficiency video coding (hevc) standard,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 22, no. 12, pp. 1649–1668, 2012.
- “Overview of the versatile video coding (vvc) standard and its applications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 10, pp. 3736–3764, 2021.
- “Multiscale structural similarity for image quality assessment,” in The Thrity-Seventh Asilomar Conference on Signals, Systems & Computers, 2003, Pacific Grove, CA, USA, 2003, pp. 1398–1402, IEEE.
- “End-to-end optimized image compression,” in 5th International Conference on Learning Representations, ICLR 2017, 2017.
- “Perceptual learned image compression with continuous rate adaptation,” in 4th Challenge on Learned Image Compression, Jun 2021.
- “Learned Image Compression With Discretized Gaussian Mixture Likelihoods and Attention Modules,” in 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR), June 2020, pp. 7936–7945.
- “Rdonet: Rate-distortion optimized learned image compression with variable depth,” 2022.
- “Roi image codec optimized for visual quality,” 2022.
- “The Unreasonable Effectiveness of Deep Features as a Perceptual Metric,” arXiv:1801.03924 [cs], Apr. 2018, arXiv: 1801.03924.
- “Learning-based conditional image coder using color separation,” in 2022 Picture Coding Symposium (PCS), 2022, pp. 49–53.
- “Flexible luma-chroma bit allocation in learned image compression for high-fidelity sharper images,” in 2022 Picture Coding Symposium (PCS), 2022, pp. 31–35.
- “The CIEDE2000 color-difference formula: Implementation notes, supplementary test data, and mathematical observations,” Color Research & Application, vol. 30, no. 1, pp. 21–30, 2005.
- “Opening the black box of learned image coders,” in 2022 Picture Coding Symposium (PCS), Dec 2022, pp. 73–77.
- Brian A Wandell, Foundations of vision., Sinauer Associates, 1995.
- “Variational image compression with a scale hyperprior,” in International Conference on Learning Representations, 2018.
- “Density modeling of images using a generalized normalization transformation,” in 4th International Conference on Learning Representations, ICLR 2016, 2016.
- “Cbam: Convolutional block attention module,” in Proceedings of the European conference on computer vision (ECCV), 2018, pp. 3–19.
- “Microsoft coco: Common objects in context,” in Computer Vision–ECCV 2014: 13th European Conference, Zurich, Switzerland, September 6-12, 2014, Proceedings, Part V 13. Springer, 2014, pp. 740–755.
- “Workshop and challenge on learned image compression (clic2020),” in CVPR, 2020.
- “Adam: A method for stochastic optimization,” arXiv preprint arXiv:1412.6980, 2014.
- “Compressai: a pytorch library and evaluation platform for end-to-end compression research,” arXiv preprint arXiv:2011.03029, 2020.