Understanding Diffusion Models: A Unified Perspective

08/25/2022
by   Calvin Luo, et al.
0

Diffusion models have shown incredible capabilities as generative models; indeed, they power the current state-of-the-art models on text-conditioned image generation such as Imagen and DALL-E 2. In this work we review, demystify, and unify the understanding of diffusion models across both variational and score-based perspectives. We first derive Variational Diffusion Models (VDM) as a special case of a Markovian Hierarchical Variational Autoencoder, where three key assumptions enable tractable computation and scalable optimization of the ELBO. We then prove that optimizing a VDM boils down to learning a neural network to predict one of three potential objectives: the original source input from any arbitrary noisification of it, the original source noise from any arbitrarily noisified input, or the score function of a noisified input at any arbitrary noise level. We then dive deeper into what it means to learn the score function, and connect the variational perspective of a diffusion model explicitly with the Score-based Generative Modeling perspective through Tweedie's Formula. Lastly, we cover how to learn a conditional distribution using diffusion models via guidance.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2021

A Variational Perspective on Diffusion-Based Generative Models and Score Matching

Discrete-time diffusion-based generative models and score matching metho...
research
11/26/2021

Conditional Image Generation with Score-Based Diffusion Models

Score-based diffusion models have emerged as one of the most promising f...
research
06/12/2023

VillanDiffusion: A Unified Backdoor Attack Framework for Diffusion Models

Diffusion Models (DMs) are state-of-the-art generative models that learn...
research
06/24/2022

Source Localization of Graph Diffusion via Variational Autoencoders for Graph Inverse Problems

Graph diffusion problems such as the propagation of rumors, computer vir...
research
08/04/2023

Diffusion probabilistic models enhance variational autoencoder for crystal structure generative modeling

The crystal diffusion variational autoencoder (CDVAE) is a machine learn...
research
09/29/2022

Creative Painting with Latent Diffusion Models

Artistic painting has achieved significant progress during recent years....
research
06/27/2023

Easing Color Shifts in Score-Based Diffusion Models

Generated images of score-based models can suffer from errors in their s...

Please sign up or login with your details

Forgot password? Click here to reset