NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-World Video Super-Resolution (2305.14669v3)
Abstract: The capability of video super-resolution (VSR) to synthesize high-resolution (HR) video from ideal datasets has been demonstrated in many works. However, applying the VSR model to real-world video with unknown and complex degradation remains a challenging task. First, existing degradation metrics in most VSR methods are not able to effectively simulate real-world noise and blur. On the contrary, simple combinations of classical degradation are used for real-world noise modeling, which led to the VSR model often being violated by out-of-distribution noise. Second, many SR models focus on noise simulation and transfer. Nevertheless, the sampled noise is monotonous and limited. To address the aforementioned problems, we propose a Negatives augmentation strategy for generalized noise modeling in Video Super-Resolution (NegVSR) task. Specifically, we first propose sequential noise generation toward real-world data to extract practical noise sequences. Then, the degeneration domain is widely expanded by negative augmentation to build up various yet challenging real-world noise sets. We further propose the augmented negative guidance loss to learn robust features among augmented negatives effectively. Extensive experiments on real-world datasets (e.g., VideoLQ and FLIR) show that our method outperforms state-of-the-art methods with clear margins, especially in visual quality. Project page is available at: https://negvsr.github.io/.
- The 2018 PIRM challenge on perceptual image super-resolution. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 0–0.
- BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), 4947–4956.
- BasicVSR++: Improving video super-resolution with enhanced propagation and alignment. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5972–5981.
- Investigating tradeoffs in real-world video super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5962–5971.
- Image blind denoising with generative adversarial network based noise modeling. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3155–3164.
- Learning temporal coherence via self-supervision for GAN-based video generation. ACM Transactions on Graphics (TOG), 39(4): 75–1.
- Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552.
- Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence, 38(2): 295–307.
- Deep Unpaired Blind Image Super-Resolution Using Self-supervised Learning and Exemplar Distillation. International Journal of Computer Vision, 1–13.
- Generative adversarial networks. Communications of the ACM, 63(11): 139–144.
- Fmix: Enhancing mixed sample data augmentation. arXiv preprint arXiv:2002.12047.
- Real-World Super-Resolution via Kernel Estimation and Noise Injection. In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
- Perceptual losses for real-time style transfer and super-resolution. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, 694–711. Springer.
- Real-world image super-resolution by exclusionary dual-learning. IEEE Transactions on Multimedia.
- Decoupled Mixup for Generalized Visual Recognition. arXiv preprint arXiv:2210.14783.
- Unfolding the Alternating Optimization for Blind Super Resolution. Advances in Neural Information Processing Systems (NeurIPS), 33.
- Learning a no-reference quality metric for single-image super-resolution. Computer Vision and Image Understanding, 158: 1–16.
- Blind/referenceless image spatial quality evaluator. In 2011 conference record of the forty fifth asilomar conference on signals, systems and computers (ASILOMAR), 723–727. IEEE.
- Making a “completely blind” image quality analyzer. IEEE Signal processing letters, 20(3): 209–212.
- NTIRE 2019 Challenge on Video Deblurring and Super-Resolution: Dataset and Study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
- Deep blind video super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 4811–4820.
- Deep Discriminative Spatial and Temporal Network for Efficient Video Deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 22191–22200.
- Optical Flow Estimation Using a Spatial Pyramid Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
- DDet: Dual-path dynamic enhancement network for real-world image super-resolution. IEEE Signal Processing Letters, 27: 481–485.
- Tdan: Temporally-deformable alignment network for video super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 3360–3369.
- EDVR: Video Restoration With Enhanced Deformable Convolutional Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) Workshops.
- Real-esrgan: Training real-world blind super-resolution with pure synthetic data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 1905–1914.
- Component divide-and-conquer for real-world image super-resolution. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16, 101–117. Springer.
- AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos. arXiv preprint arXiv:2206.07038.
- Mitigating Artifacts in Real-World Video Super-Resolution Models. arXiv preprint arXiv:2212.07339.
- Real-world video super-resolution: A benchmark dataset and a decomposition based learning scheme. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 4781–4790.
- Cutmix: Regularization strategy to train strong classifiers with localizable features. In Proceedings of the IEEE/CVF international conference on computer vision, 6023–6032.
- mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412.
- Designing a Practical Degradation Model for Deep Blind Image Super-Resolution. In IEEE International Conference on Computer Vision, 4791–4800.