ED: Perceptually tuned Enhanced Compression Model (2401.02145v1)
Abstract: This paper summarises the design of the candidate ED for the Challenge on Learned Image Compression 2024. This candidate aims at providing an anchor based on conventional coding technologies to the learning-based approaches mostly targeted in the challenge. The proposed candidate is based on the Enhanced Compression Model (ECM) developed at JVET, the Joint Video Experts Team of ITU-T VCEG and ISO/IEC MPEG. Here, ECM is adapted to the challenge objective: to maximise the perceived quality, the encoding is performed according to a perceptual metric, also the sequence selection is performed in a perceptual manner to fit the target bit per pixel objectives. The primary objective of this candidate is to assess the recent developments in video coding standardisation and in parallel to evaluate the progress made by learning-based techniques. To this end, this paper explains how to generate coded images fulfilling the challenge requirements, in a reproducible way, targeting the maximum performance.
- “Overview of the versatile video coding (VVC) standard and its applications,” IEEE Transactions on Circuits and Systems for Video Technology, vol. 31, no. 10, pp. 3736–3764, 2021.
- JVET-AF0006, “Jvet ahg report: Ecm software development,” 32nd JVET Hannover, DE, 13–20 October 2023, October 2023.
- JVET, “https://vcgit.hhi.fraunhofer.de/jvet/vvcsoftware_vtm,” 2021.
- JVET-AE2025, “Algorithm description of enhanced compression model 10 (ECM 10),” 31st JVET Meeting, Geneva, CH, 11–19 July 2023, July 2023.
- International Telecommunication Union, “Recommendation ITU-R BT.709-6 - Parameter values for the HDTV standards for production and international programme exchange BT Series Broadcasting service,” 2015.
- Christian Helmrich et al., “AHG10: Improved perceptually optimized QP adaptation and associated distortion measure,” in doc. JVET-K0206, Ljubljana, July 2018, 2018.
- Christian Helmrich et al., “XPSNR: A low-complexity extension of the perceptually weighted peak signal-to-noise ratio for high-resolution video quality assessment,” in ICASSP 2020 - 2020 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP), 2020, pp. 2727–2731.
- Netflix, “VMAF - video multi-method assessment fusion,” 2018.