2000 character limit reached
Solving Linear Inverse Problems Provably via Posterior Sampling with Latent Diffusion Models (2307.00619v1)
Published 2 Jul 2023 in cs.LG, cs.AI, and stat.ML
Abstract: We present the first framework to solve linear inverse problems leveraging pre-trained latent diffusion models. Previously proposed algorithms (such as DPS and DDRM) only apply to pixel-space diffusion models. We theoretically analyze our algorithm showing provable sample recovery in a linear model setting. The algorithmic insight obtained from our analysis extends to more general settings often considered in practice. Experimentally, we outperform previously proposed posterior sampling algorithms in a wide variety of problems including random inpainting, block inpainting, denoising, deblurring, destriping, and super-resolution.
- Brian D.O. Anderson “Reverse-time diffusion equation models” In Stochastic Processes and their Applications 12.3 Elsevier, 1982, pp. 313–326
- “Single-Shot Adaptation using Score-Based Models for MRI Reconstruction” In International Society for Magnetic Resonance in Medicine, Annual Meeting, 2022
- “Cold Diffusion: Inverting arbitrary image transforms without noise” In arXiv preprint arXiv:2208.09392, 2022
- “Align your latents: High-resolution video synthesis with latent diffusion models” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 22563–22575
- “Compressed sensing using generative models” In International Conference on Machine Learning, 2017, pp. 537–546 PMLR
- Stanley H Chan, Xiran Wang and Omar A Elgendy “Plug-and-play ADMM for image restoration: Fixed-point convergence and applications” In IEEE Transactions on Computational Imaging 3.1 IEEE, 2016, pp. 84–98
- “Score Approximation, Estimation and Distribution Recovery of Diffusion Models on Low-Dimensional Data” In arXiv preprint arXiv:2302.07194, 2023
- “Sampling is as easy as learning the score: theory for diffusion models with minimal data assumptions” In arXiv preprint arXiv:2209.11215, 2022
- Sitan Chen, Giannis Daras and Alexandros G Dimakis “Restoration-Degradation Beyond Linear Diffusions: A Non-Asymptotic Analysis For DDIM-Type Samplers” In arXiv preprint arXiv:2303.03384, 2023
- “Ilvr: Conditioning method for denoising diffusion probabilistic models” In arXiv preprint arXiv:2108.02938, 2021
- “Diffusion Posterior Sampling for General Noisy Inverse Problems” In The Eleventh International Conference on Learning Representations, 2023 URL: https://openreview.net/forum?id=OnD9zGAGT0k
- Hyungjin Chung, Jeongsol Kim and Jong Chul Ye “Direct Diffusion Bridge using Data Consistency for Inverse Problems” In arXiv preprint arXiv:2305.19809, 2023
- “Improving Diffusion Models for Inverse Problems using Manifold Constraints” In Advances in Neural Information Processing Systems, 2022 URL: https://openreview.net/forum?id=nJJjv0JDJju
- “Score-guided intermediate layer optimization: Fast langevin mixing for inverse problem” In arXiv preprint arXiv:2206.09104, 2022
- “Soft diffusion: Score matching for general corruptions” In arXiv preprint arXiv:2209.05442, 2022
- “Inversion by direct iteration: An alternative to denoising diffusion for image restoration” In arXiv preprint arXiv:2303.11435, 2023
- “Imagenet: A large-scale hierarchical image database” In 2009 IEEE conference on computer vision and pattern recognition, 2009, pp. 248–255 Ieee
- “Diffusion models beat gans on image synthesis” In Advances in Neural Information Processing Systems 34, 2021, pp. 8780–8794
- “DataComp: In search of the next generation of multimodal datasets” In arXiv preprint arXiv:2304.14108, 2023
- Jonathan Ho, Ajay Jain and Pieter Abbeel “Denoising diffusion probabilistic models” In Advances in Neural Information Processing Systems 33, 2020, pp. 6840–6851
- “Estimation of non-normalized statistical models by score matching.” In Journal of Machine Learning Research 6.4, 2005
- “Robust compressed sensing mri with deep generative priors” In Advances in Neural Information Processing Systems 34, 2021, pp. 14938–14954
- “Instance-optimal compressed sensing via posterior sampling” In arXiv preprint arXiv:2106.11438, 2021
- “Fairness for Image Generation with Uncertain Sensitive Attributes” In Proceedings of the 38th International Conference on Machine Learning 139, Proceedings of Machine Learning Research PMLR, 2021, pp. 4721–4732 URL: https://proceedings.mlr.press/v139/jalal21b.html
- Tero Karras, Samuli Laine and Timo Aila “A style-based generator architecture for generative adversarial networks” In Proceedings of the IEEE/CVF conference on computer vision and pattern recognition, 2019, pp. 4401–4410
- “Denoising Diffusion Restoration Models” In Advances in Neural Information Processing Systems
- “GSURE-Based Diffusion Model Training with Corrupted Data” In arXiv preprint arXiv:2305.13128, 2023
- “Soft truncation: A universal training technique of score-based diffusion model for high precision score estimation” In International Conference on Machine Learning, 2022, pp. 11201–11228 PMLR
- “Audioldm: Text-to-audio generation with latent diffusion models” In arXiv preprint arXiv:2301.12503, 2023
- “Coherent Semantic Attention for Image Inpainting” In 2019 IEEE/CVF International Conference on Computer Vision (ICCV) IEEE, 2019 DOI: 10.1109/iccv.2019.00427
- “Repaint: Inpainting using denoising diffusion probabilistic models” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 11461–11471
- Gary Mataev, Peyman Milanfar and Michael Elad “DeepRED: Deep image prior powered by RED” In Proceedings of the IEEE/CVF International Conference on Computer Vision Workshops, 2019, pp. 0–0
- “Pulse: Self-supervised photo upsampling via latent space exploration of generative models” In Proceedings of the ieee/cvf conference on computer vision and pattern recognition, 2020, pp. 2437–2445
- “Deep learning techniques for inverse problems in imaging” In IEEE Journal on Selected Areas in Information Theory 1.1 IEEE, 2020, pp. 39–56
- “Context encoders: Feature learning by inpainting” In Proceedings of the IEEE conference on computer vision and pattern recognition, 2016, pp. 2536–2544
- “Brain imaging generation with latent diffusion models” In Deep Generative Models: Second MICCAI Workshop, DGM4MICCAI 2022, Held in Conjunction with MICCAI 2022, Singapore, September 22, 2022, Proceedings, 2022, pp. 117–126 Springer
- “Encoding in Style: a StyleGAN Encoder for Image-to-Image Translation” In arXiv preprint arXiv:2008.00951, 2020
- Yaniv Romano, Michael Elad and Peyman Milanfar “The little engine that could: Regularization by denoising (RED)” In SIAM Journal on Imaging Sciences 10.4 SIAM, 2017, pp. 1804–1844
- “High-resolution image synthesis with latent diffusion models” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2022, pp. 10684–10695
- “A Theoretical Justification for Image Inpainting using Denoising Diffusion Probabilistic Models” In arXiv preprint arXiv:2302.01217, 2023
- “LAION-400M: Open Dataset of CLIP-Filtered 400 Million Image-Text Pairs”, 2021 arXiv:2111.02114 [cs.CV]
- “LAION-5B: An open large-scale dataset for training next generation image-text models”, 2022 arXiv:2210.08402 [cs.CV]
- “Pseudoinverse-guided diffusion models for inverse problems” In International Conference on Learning Representations, 2023
- “Generative modeling by estimating gradients of the data distribution” In Advances in Neural Information Processing Systems 32, 2019
- “Improved techniques for training score-based generative models” In Advances in neural information processing systems 33, 2020, pp. 12438–12448
- “Score-Based Generative Modeling through Stochastic Differential Equations” In International Conference on Learning Representations, 2021
- “Score-Based Generative Modeling through Stochastic Differential Equations” In International Conference on Learning Representations
- “High-resolution image reconstruction with latent diffusion models from human brain activity” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 14453–14463
- Singanallur V Venkatakrishnan, Charles A Bouman and Brendt Wohlberg “Plug-and-play priors for model based reconstruction” In 2013 IEEE Global Conference on Signal and Information Processing, 2013, pp. 945–948 IEEE
- Pascal Vincent “A connection between score matching and denoising autoencoders” In Neural computation 23.7 MIT Press, 2011, pp. 1661–1674
- “Imagen editor and editbench: Advancing and evaluating text-guided image inpainting” In Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition, 2023, pp. 18359–18369
- “Free-Form Image Inpainting With Gated Convolution” In 2019 IEEE/CVF International Conference on Computer Vision (ICCV) IEEE, 2019 DOI: 10.1109/iccv.2019.00457