Papers
Topics
Authors
Recent
Gemini 2.5 Flash
Gemini 2.5 Flash
119 tokens/sec
GPT-4o
56 tokens/sec
Gemini 2.5 Pro Pro
43 tokens/sec
o3 Pro
6 tokens/sec
GPT-4.1 Pro
47 tokens/sec
DeepSeek R1 via Azure Pro
28 tokens/sec
2000 character limit reached

SpotDiffusion: A Fast Approach For Seamless Panorama Generation Over Time (2407.15507v1)

Published 22 Jul 2024 in cs.CV

Abstract: Generating high-resolution images with generative models has recently been made widely accessible by leveraging diffusion models pre-trained on large-scale datasets. Various techniques, such as MultiDiffusion and SyncDiffusion, have further pushed image generation beyond training resolutions, i.e., from square images to panorama, by merging multiple overlapping diffusion paths or employing gradient descent to maintain perceptual coherence. However, these methods suffer from significant computational inefficiencies due to generating and averaging numerous predictions, which is required in practice to produce high-quality and seamless images. This work addresses this limitation and presents a novel approach that eliminates the need to generate and average numerous overlapping denoising predictions. Our method shifts non-overlapping denoising windows over time, ensuring that seams in one timestep are corrected in the next. This results in coherent, high-resolution images with fewer overall steps. We demonstrate the effectiveness of our approach through qualitative and quantitative evaluations, comparing it with MultiDiffusion, SyncDiffusion, and StitchDiffusion. Our method offers several key benefits, including improved computational efficiency and faster inference times while producing comparable or better image quality.

Definition Search Book Streamline Icon: https://streamlinehq.com
References (25)
  1. Multidiffusion: Fusing diffusion paths for controlled image generation. ICML, 2023.
  2. Diffusion models in vision: A survey. TPAMI, 2023.
  3. Diffusion models beat gans on image synthesis. NeurIPS, 2021.
  4. Fast timing-conditioned latent audio diffusion. arXiv:2402.04825, 2024.
  5. Adversarial text-to-image synthesis: A review. Neural Networks, 2021.
  6. Generative adversarial networks. Communications of the ACM, 2020.
  7. Clipscore: A reference-free evaluation metric for image captioning. EMNLP, 2021.
  8. Gans trained by a two time-scale update rule converge to a local nash equilibrium. In NeurIPS, 2017.
  9. Denoising diffusion probabilistic models. In NeurIPS, 2020.
  10. Lora: Low-rank adaptation of large language models. ICLR, 2022.
  11. The role of imagenet classes in fr\\\backslash\’echet inception distance. ICLR, 2023.
  12. Syncdiffusion: Coherent montage via synchronized joint diffusions. NeurIPS, 2023.
  13. Repaint: Inpainting using denoising diffusion probabilistic models. In CVPR, 2022.
  14. Diffusion models, image super-resolution and everything: A survey. arXiv:2401.00736, 2024.
  15. On aliased resizing and surprising subtleties in gan evaluation. In CVPR, 2022.
  16. Learning transferable visual models from natural language supervision. In ICML, 2021.
  17. Zero-shot text-to-image generation. In ICML, 2021.
  18. High-resolution image synthesis with latent diffusion models. In CVPR, 2022.
  19. Deep unsupervised learning using nonequilibrium thermodynamics. In ICML, 2015.
  20. Generative modeling by estimating gradients of the data distribution. In NeurIPS, 2019.
  21. Score-based generative modeling through stochastic differential equations. In ICLR, 2021.
  22. Customizing 360-degree panoramas through text-to-image diffusion models. In WACV, 2024.
  23. Imagereward: Learning and evaluating human preferences for text-to-image generation. NeurIPS, 2024.
  24. Diffusion models: A comprehensive survey of methods and applications. ACM Computing Surveys, 2023.
  25. Text-to-image diffusion models in generative ai: A survey. arXiv:2303.07909, 2023.
User Edit Pencil Streamline Icon: https://streamlinehq.com
Authors (3)
  1. Stanislav Frolov (28 papers)
  2. Brian B. Moser (16 papers)
  3. Andreas Dengel (188 papers)

Summary

We haven't generated a summary for this paper yet.

X Twitter Logo Streamline Icon: https://streamlinehq.com
Youtube Logo Streamline Icon: https://streamlinehq.com