NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results (2104.10781v6)

Published 21 Apr 2021 in eess.IV and cs.CV

Abstract: This paper reviews the first NTIRE challenge on quality enhancement of compressed video, with a focus on the proposed methods and results. In this challenge, the new Large-scale Diverse Video (LDV) dataset is employed. The challenge has three tracks. Tracks 1 and 2 aim at enhancing the videos compressed by HEVC at a fixed QP, while Track 3 is designed for enhancing the videos compressed by x265 at a fixed bit-rate. Besides, the quality enhancement of Tracks 1 and 3 targets at improving the fidelity (PSNR), and Track 2 targets at enhancing the perceptual quality. The three tracks totally attract 482 registrations. In the test phase, 12 teams, 8 teams and 11 teams submitted the final results of Tracks 1, 2 and 3, respectively. The proposed methods and solutions gauge the state-of-the-art of video quality enhancement. The homepage of the challenge: https://github.com/RenYang-home/NTIRE21_VEnh

Authors (72)

Ren Yang (25 papers)
Radu Timofte (299 papers)
Jing Liu (526 papers)
Yi Xu (304 papers)
Xinjian Zhang (10 papers)
Minyi Zhao (11 papers)
Shuigeng Zhou (81 papers)
Kelvin C. K. Chan (34 papers)
Shangchen Zhou (58 papers)
Xiangyu Xu (48 papers)
Chen Change Loy (288 papers)
Xin Li (980 papers)
Fanglong Liu (4 papers)
He Zheng (7 papers)
Lielin Jiang (2 papers)
Qi Zhang (785 papers)
Dongliang He (46 papers)
Fu Li (86 papers)
Qingqing Dang (15 papers)
Yibin Huang (7 papers)

Citations (37)

View on Semantic Scholar

Summary

NTIRE 2021 Challenge on Quality Enhancement of Compressed Video: Methods and Results

The paper presents a detailed overview of the NTIRE 2021 Challenge, which focused on quality enhancement for compressed videos utilizing a newly introduced dataset, the Large-scale Diverse Video (LDV) dataset. This essay outlines the key methods and results from the challenge, emphasizing the technical approaches taken by the participating teams within the context of three distinct tracks (Tracks 1, 2, and 3) aimed at enhancing video quality using different metrics and compression methods.

Challenge Overview

The NTIRE 2021 Challenge consisted of three tracks: Tracks 1 and 3 aimed at enhancing videos compressed using HEVC and x265 encoders, respectively, under fixed QP and bit-rate conditions, with evaluations based on fidelity metrics such as PSNR and MS-SSIM. Track 2, however, focused on perceptual quality improvements, judged by MOS scores and other perceptual metrics like LPIPS and FID.

Methods and Techniques

A variety of approaches were presented by the participating teams to tackle the challenges set forth in each track. Some key highlights include:

BILIBILI AI {content} FDU Team: They proposed a Spatiotemporal Model with Gated Fusion for fidelity tracks and a perceptual extension for the perceptual track. Their architecture employed deformable convolutions and channel attention mechanisms enhanced by gated fusion.
NTU-SLab Team: They introduced the BasicVSR++ method, which improved upon BasicVSR through grid propagation and flow-guided deformable alignment to efficiently capture and align spatiotemporal features.
VUE Team: Leveraged BasicVSR with multi-stage training and ensemble techniques for fidelity tracks, while proposing an innovative adaptive spatial-temporal fusion for perceptual quality enhancement.
NOAHTCV Team: Implemented a multi-scale network with a deformable temporal fusion mechanism, using a shared U-Net model for feature extraction and alignment, optimized through tailored loss functions for each track.
MT.MaxClear Team: Their work focused on stability by introducing regularization techniques in deformable convolutions, enhancing existing EDVR frameworks by offset stabilization for continuity across frames.

Results and Implications

The results indicated varied success across different methodologies, with BILIBILI AI {content} FDU and NTU-SLab teams consistently achieving top results across multiple tracks. The challenge facilitated a deeper understanding of effective methodologies for video compression artifact reduction, with emphasis on trade-offs between computational efficiency and image fidelity. The novel contributions of grid propagation and constrained deformable alignment have shown promise in advancing state-of-the-art techniques in video quality enhancement.

Future Directions

The NTIRE 2021 Challenge highlights the increasing complexity and demands in providing high-quality video content under resource constraints. Future research directions may include:

Developing models that balance speed and quality for real-time applications.
Applying novel architectures like transformers for video enhancement tasks.
Robustness against diverse video content, ensuring reliable enhancements regardless of scene complexity or motion characteristics.

Conclusion

The NTIRE 2021 challenge successfully showcased the breadth of methods applicable to video quality enhancement, from traditional convolutional networks to advanced methods employing optical flow and deformable convolutions. Continued advancements in this domain will likely leverage the collective insights gained through challenges like NTIRE, driving innovations in efficient video streaming and consumption.

Related Papers

GitHub

GitHub - RenYang-home/NTIRE21_VEnh (71 stars)