f-DM: A Multi-stage Diffusion Model via Progressive Signal Transformation

10/10/2022
by   Jiatao Gu, et al.
7

Diffusion models (DMs) have recently emerged as SoTA tools for generative modeling in various domains. Standard DMs can be viewed as an instantiation of hierarchical variational autoencoders (VAEs) where the latent variables are inferred from input-centered Gaussian distributions with fixed scales and variances. Unlike VAEs, this formulation limits DMs from changing the latent spaces and learning abstract representations. In this work, we propose f-DM, a generalized family of DMs which allows progressive signal transformation. More precisely, we extend DMs to incorporate a set of (hand-designed or learned) transformations, where the transformed input is the mean of each diffusion step. We propose a generalized formulation and derive the corresponding de-noising objective with a modified sampling algorithm. As a demonstration, we apply f-DM in image generation tasks with a range of functions, including down-sampling, blurring, and learned transformations based on the encoder of pretrained VAEs. In addition, we identify the importance of adjusting the noise levels whenever the signal is sub-sampled and propose a simple rescaling recipe. f-DM can produce high-quality samples on standard image generation benchmarks like FFHQ, AFHQ, LSUN, and ImageNet with better efficiency and semantic interpretation.

READ FULL TEXT

page 20

page 22

page 23

page 24

page 25

page 26

page 27

page 28

research
08/06/2021

ILVR: Conditioning Method for Denoising Diffusion Probabilistic Models

Denoising diffusion probabilistic models (DDPM) have shown remarkable pe...
research
02/01/2022

Progressive Distillation for Fast Sampling of Diffusion Models

Diffusion models have recently shown great promise for generative modeli...
research
04/10/2023

Binary Latent Diffusion

In this paper, we show that a binary latent space can be explored for co...
research
07/19/2023

Text2Layer: Layered Image Generation using Latent Diffusion Model

Layer compositing is one of the most popular image editing workflows amo...
research
07/09/2022

Improving Diffusion Model Efficiency Through Patching

Diffusion models are a powerful class of generative models that iterativ...
research
11/30/2021

Diffusion Autoencoders: Toward a Meaningful and Decodable Representation

Diffusion probabilistic models (DPMs) have achieved remarkable quality i...
research
01/27/2021

Learning Non-linear Wavelet Transformation via Normalizing Flow

Wavelet transformation stands as a cornerstone in modern data analysis a...

Please sign up or login with your details

Forgot password? Click here to reset