Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
80 tokens/sec
GPT-4o
59 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
7 tokens/sec
GPT-4.1 Pro
50 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

DiffVSR: Enhancing Real-World Video Super-Resolution with Diffusion Models for Advanced Visual Quality and Temporal Consistency (2501.10110v2)

Published 17 Jan 2025 in cs.CV

Abstract: Diffusion models have demonstrated exceptional capabilities in image generation and restoration, yet their application to video super-resolution faces significant challenges in maintaining both high fidelity and temporal consistency. We present DiffVSR, a diffusion-based framework for real-world video super-resolution that effectively addresses these challenges through key innovations. For intra-sequence coherence, we develop a multi-scale temporal attention module and temporal-enhanced VAE decoder that capture fine-grained motion details. To ensure inter-sequence stability, we introduce a noise rescheduling mechanism with an interweaved latent transition approach, which enhances temporal consistency without additional training overhead. We propose a progressive learning strategy that transitions from simple to complex degradations, enabling robust optimization despite limited high-quality video data. Extensive experiments demonstrate that DiffVSR delivers superior results in both visual quality and temporal consistency, setting a new performance standard in real-world video super-resolution.

User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (9)
  1. Xiaohui Li (26 papers)
  2. Yihao Liu (85 papers)
  3. Shuo Cao (121 papers)
  4. Ziyan Chen (17 papers)
  5. Shaobin Zhuang (12 papers)
  6. Xiangyu Chen (84 papers)
  7. Yinan He (34 papers)
  8. Yi Wang (1038 papers)
  9. Yu Qiao (563 papers)
Youtube Logo Streamline Icon: https://streamlinehq.com