Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
184 tokens/sec
GPT-4o
7 tokens/sec
Gemini 2.5 Pro Pro
45 tokens/sec
o3 Pro
4 tokens/sec
GPT-4.1 Pro
38 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

NegVSR: Augmenting Negatives for Generalized Noise Modeling in Real-World Video Super-Resolution (2305.14669v3)

Published 24 May 2023 in cs.CV

Abstract: The capability of video super-resolution (VSR) to synthesize high-resolution (HR) video from ideal datasets has been demonstrated in many works. However, applying the VSR model to real-world video with unknown and complex degradation remains a challenging task. First, existing degradation metrics in most VSR methods are not able to effectively simulate real-world noise and blur. On the contrary, simple combinations of classical degradation are used for real-world noise modeling, which led to the VSR model often being violated by out-of-distribution noise. Second, many SR models focus on noise simulation and transfer. Nevertheless, the sampled noise is monotonous and limited. To address the aforementioned problems, we propose a Negatives augmentation strategy for generalized noise modeling in Video Super-Resolution (NegVSR) task. Specifically, we first propose sequential noise generation toward real-world data to extract practical noise sequences. Then, the degeneration domain is widely expanded by negative augmentation to build up various yet challenging real-world noise sets. We further propose the augmented negative guidance loss to learn robust features among augmented negatives effectively. Extensive experiments on real-world datasets (e.g., VideoLQ and FLIR) show that our method outperforms state-of-the-art methods with clear margins, especially in visual quality. Project page is available at: https://negvsr.github.io/.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (34)
  1. The 2018 PIRM challenge on perceptual image super-resolution. In Proceedings of the European Conference on Computer Vision (ECCV) Workshops, 0–0.
  2. BasicVSR: The Search for Essential Components in Video Super-Resolution and Beyond. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR), 4947–4956.
  3. BasicVSR++: Improving video super-resolution with enhanced propagation and alignment. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 5972–5981.
  4. Investigating tradeoffs in real-world video super-resolution. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 5962–5971.
  5. Image blind denoising with generative adversarial network based noise modeling. In Proceedings of the IEEE conference on computer vision and pattern recognition, 3155–3164.
  6. Learning temporal coherence via self-supervision for GAN-based video generation. ACM Transactions on Graphics (TOG), 39(4): 75–1.
  7. Improved regularization of convolutional neural networks with cutout. arXiv preprint arXiv:1708.04552.
  8. Image super-resolution using deep convolutional networks. IEEE transactions on pattern analysis and machine intelligence, 38(2): 295–307.
  9. Deep Unpaired Blind Image Super-Resolution Using Self-supervised Learning and Exemplar Distillation. International Journal of Computer Vision, 1–13.
  10. Generative adversarial networks. Communications of the ACM, 63(11): 139–144.
  11. Fmix: Enhancing mixed sample data augmentation. arXiv preprint arXiv:2002.12047.
  12. Real-World Super-Resolution via Kernel Estimation and Noise Injection. In The IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
  13. Perceptual losses for real-time style transfer and super-resolution. In Computer Vision–ECCV 2016: 14th European Conference, Amsterdam, The Netherlands, October 11-14, 2016, Proceedings, Part II 14, 694–711. Springer.
  14. Real-world image super-resolution by exclusionary dual-learning. IEEE Transactions on Multimedia.
  15. Decoupled Mixup for Generalized Visual Recognition. arXiv preprint arXiv:2210.14783.
  16. Unfolding the Alternating Optimization for Blind Super Resolution. Advances in Neural Information Processing Systems (NeurIPS), 33.
  17. Learning a no-reference quality metric for single-image super-resolution. Computer Vision and Image Understanding, 158: 1–16.
  18. Blind/referenceless image spatial quality evaluator. In 2011 conference record of the forty fifth asilomar conference on signals, systems and computers (ASILOMAR), 723–727. IEEE.
  19. Making a “completely blind” image quality analyzer. IEEE Signal processing letters, 20(3): 209–212.
  20. NTIRE 2019 Challenge on Video Deblurring and Super-Resolution: Dataset and Study. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops.
  21. Deep blind video super-resolution. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 4811–4820.
  22. Deep Discriminative Spatial and Temporal Network for Efficient Video Deblurring. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 22191–22200.
  23. Optical Flow Estimation Using a Spatial Pyramid Network. In Proceedings of the IEEE Conference on Computer Vision and Pattern Recognition (CVPR).
  24. DDet: Dual-path dynamic enhancement network for real-world image super-resolution. IEEE Signal Processing Letters, 27: 481–485.
  25. Tdan: Temporally-deformable alignment network for video super-resolution. In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 3360–3369.
  26. EDVR: Video Restoration With Enhanced Deformable Convolutional Networks. In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition(CVPR) Workshops.
  27. Real-esrgan: Training real-world blind super-resolution with pure synthetic data. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 1905–1914.
  28. Component divide-and-conquer for real-world image super-resolution. In Computer Vision–ECCV 2020: 16th European Conference, Glasgow, UK, August 23–28, 2020, Proceedings, Part VIII 16, 101–117. Springer.
  29. AnimeSR: Learning Real-World Super-Resolution Models for Animation Videos. arXiv preprint arXiv:2206.07038.
  30. Mitigating Artifacts in Real-World Video Super-Resolution Models. arXiv preprint arXiv:2212.07339.
  31. Real-world video super-resolution: A benchmark dataset and a decomposition based learning scheme. In Proceedings of the IEEE/CVF International Conference on Computer Vision, 4781–4790.
  32. Cutmix: Regularization strategy to train strong classifiers with localizable features. In Proceedings of the IEEE/CVF international conference on computer vision, 6023–6032.
  33. mixup: Beyond empirical risk minimization. arXiv preprint arXiv:1710.09412.
  34. Designing a Practical Degradation Model for Deep Blind Image Super-Resolution. In IEEE International Conference on Computer Vision, 4791–4800.
Citations (2)

Summary

We haven't generated a summary for this paper yet.