Consistency of 4DiM outputs across multiple views
Determine whether 4DiM (Controlling Space and Time with Diffusion Models) can produce multi-view-consistent outputs when synthesizing images under specified viewpoints and timestamps, i.e., ascertain if generations from multiple cameras at given times are mutually consistent so they can be used as multi-view videos for reconstruction.
Sponsor
References
4DiM trained a diffusion model for synthesizing images under novel views and timestamps, but it's unclear whether their model can produce consistent multi-view videos.
— CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models
(2411.18613 - Wu et al., 2024) in Section 2 (Related Work) – Video Generation Models with Camera Control