A Survey on Generative Diffusion Model

Published 6 Sep 2022 in cs.AI | (2209.02646v10)

Abstract: Deep generative models have unlocked another profound realm of human creativity. By capturing and generalizing patterns within data, we have entered the epoch of all-encompassing Artificial Intelligence for General Creativity (AIGC). Notably, diffusion models, recognized as one of the paramount generative models, materialize human ideation into tangible instances across diverse domains, encompassing imagery, text, speech, biology, and healthcare. To provide advanced and comprehensive insights into diffusion, this survey comprehensively elucidates its developmental trajectory and future directions from three distinct angles: the fundamental formulation of diffusion, algorithmic enhancements, and the manifold applications of diffusion. Each layer is meticulously explored to offer a profound comprehension of its evolution. Structured and summarized approaches are presented in https://github.com/chq1155/A-Survey-on-Generative-Diffusion-Model.

Abstract PDF HTML Upgrade to Chat

References (272)

Citations (122)

View on Semantic Scholar

Summary

The paper introduces three core formulations (DDPM, Score SDE, and conditional models) that reverse noise to generate high-quality data.
The paper outlines algorithmic improvements such as sampling acceleration and likelihood optimization to boost efficiency and output quality.
The paper highlights diverse applications—from image and text generation to molecule and audio synthesis—demonstrating the models' versatility.

Overview of Generative Diffusion Models

The paper provides a comprehensive survey on generative diffusion models, exploring their fundamental formulations, algorithmic improvements, and diverse applications across several domains. Diffusion models have emerged as a significant class of deep generative models, contributing to areas such as imagery, text, speech, biology, and healthcare.

Fundamental Formulations

Diffusion models, as discussed, revolve around a stochastic process that gradually transforms data distributions into a simpler prior, generally Gaussian, and reverses back during sampling. Three foundational formulations underline these processes:

Denoised Diffusion Probabilistic Models (DDPM): DDPM employs a discrete forward process with a sequence of noise coefficients resulting in a predefined Gaussian noise. The reverse process denoises these samples using a learned neural network in a step-by-step manner.
Score SDE Formulation: Extends the discrete-time methods to a continuous stochastic differential equation framework. This leverages ODEs and SDEs to improve integrability and flexibility.
Conditional Diffusion Probabilistic Models: These models use conditions such as text or class labels, employing classifier-free guidance or classifier-based guidance to generate controllable outputs.

Algorithm Improvements

The paper delineates four primary areas of advancements that aim to improve diffusion models:

Sampling Acceleration: Sampling in diffusion models inherently requires numerous iterations. Techniques like knowledge distillation, training-free sampling, and model merging with GANs and VAEs have been pursued to expedite sampling.
Diffusion Process Design: Innovations have been made to improve the forward diffusion processes, including operating in latent spaces and on non-Euclidean spaces, enhancing the ease of reverse processes and broadening the scope of applicable domains.
Likelihood Optimization: These strategies focus on optimizing the models' likelihood, improving the overall generative quality and learning efficiency.
Bridging Distributions: Techniques have been developed to bridge arbitrary distributions, which is particularly useful for tasks like image-to-image translation.

Applications

Generative diffusion models find applications across multiple domains:

Image Generation: Models excel in generating high-fidelity images both conditionally (e.g., text-to-image synthesis) and unconditionally.
3D and Video Generation: Bringing advancements to rendering 3D objects and video frames.
Medical Imaging: Used for super-resolution, denoising, and reconstruction, aiding diagnosis and treatment planning.
Text Generation: Assists in creating text based on conditions using parallel processing strategies.
Time Series and Audio Generation: Facilitates the synthesis of coherent sequences of data, aiding in prediction and transformation tasks.
Molecule and Graph Generation: Applied in science to model and predict molecular structures and interactions, significant for drug development.

Implications and Future Directions

The survey points to diffusion models as pivotal in generative modeling, offering robust frameworks for capturing complex data distributions. Future work is likely to focus on accelerating sampling methods, exploring new diffusion processes, and integrating with various machine learning paradigms to overcome limitations posed by large-scale and high-dimensional data. Furthermore, exploring more efficient methods for bridging distribution gaps could enhance their applicability across diverse fields like AI-driven scientific research and biomedical advancements.

This comprehensive survey underlines diffusion models' versatility and transformative potential, establishing them as prominent contributors to the generative modeling landscape, with ample room for future exploration and development.

Markdown

Paper to Video (Beta)

No one has generated a video about this paper yet.

Whiteboard

No one has generated a whiteboard explanation for this paper yet.

Paper Prompts

Top Community Prompts

Explain it Like I'm 14

off on

Knowledge Gaps

off on

Practical Applications

off on

Glossary

off on

Conceptual Simplification

off on

Open Problems

We haven't generated a list of open problems mentioned in this paper yet.

Generate Now

Continue Learning

Authors (7)

Collections

GitHub

GitHub - chq1155/A-Survey-on-Generative-Diffusion-Model (898 stars)

Tweets

YouTube

Show All Videos

HackerNews

A Survey on Generative Diffusion Model (3 points, 0 comments)

A Survey on Generative Diffusion Model

Summary

Overview of Generative Diffusion Models

Fundamental Formulations

Algorithm Improvements

Applications

Implications and Future Directions

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Authors (7)

Collections

GitHub

Tweets

YouTube

HackerNews

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research

A Survey on Generative Diffusion Model

Summary

Overview of Generative Diffusion Models

Fundamental Formulations

Algorithm Improvements

Applications

Implications and Future Directions

Paper to Video (Beta)

Whiteboard

Paper Prompts

Top Community Prompts

Open Problems

Continue Learning

Related Papers

Authors (7)

Collections

GitHub

Tweets

YouTube

HackerNews

Don't miss out on important new AI/ML research

Sign up for free to explore the frontiers of research