Papers

Topics

Authors

Recent

View all

AI Research Assistant

Well-researched responses based on relevant abstracts and paper content.

Custom Instructions Pro

Preferences or requirements that you'd like Emergent Mind to consider when generating responses.

Gemini 2.5 Flash

Gemini 2.5 Flash 73 tok/s

Gemini 2.5 Pro 46 tok/s Pro

GPT-5 Medium 13 tok/s Pro

GPT-5 High 14 tok/s Pro

GPT-4o 86 tok/s Pro

Kimi K2 156 tok/s Pro

GPT OSS 120B 388 tok/s Pro

Claude Sonnet 4 37 tok/s Pro

2000 character limit reached

Rethinking Video Super-Resolution: Towards Diffusion-Based Methods without Motion Alignment (2503.03355v4)

Published 5 Mar 2025 in cs.CV, cs.LG, and eess.IV

Abstract: In this work, we rethink the approach to video super-resolution by introducing a method based on the Diffusion Posterior Sampling framework, combined with an unconditional video diffusion transformer operating in latent space. The video generation model, a diffusion transformer, functions as a space-time model. We argue that a powerful model, which learns the physics of the real world, can easily handle various kinds of motion patterns as prior knowledge, thus eliminating the need for explicit estimation of optical flows or motion parameters for pixel alignment. Furthermore, a single instance of the proposed video diffusion transformer model can adapt to different sampling conditions without re-training. Empirical results on synthetic and real-world datasets illustrate the feasibility of diffusion-based, alignment-free video super-resolution.