Masked Diffusion Models are Fast Learners

06/20/2023
by   Jiachen Lei, et al.
0

Diffusion models have emerged as the de-facto technique for image generation, yet they entail significant computational overhead, hindering the technique's broader application in the research community. We propose a prior-based denoising training framework, the first to incorporate the pre-train and fine-tune paradigm into the diffusion model training process, which substantially improves training efficiency and shows potential in facilitating various downstream tasks. Our approach centers on masking a high proportion (e.g., up to 90 denoise the visible areas, thereby guiding the diffusion model to learn more salient features from training data as prior knowledge. By utilizing this masked learning process in a pre-training stage, we efficiently train the ViT-based diffusion model on CelebA-HQ 256x256 in the pixel space, achieving a 4x acceleration and enhancing the quality of generated images compared to DDPM. Moreover, our masked pre-training technique is universally applicable to various diffusion models that directly generate images in the pixel space and facilitates learning pre-trained models with excellent generalizability: a diffusion model pre-trained on VGGFace2 attains a 46 through fine-tuning with merely 10 https://github.com/jiachenlei/maskdm.

READ FULL TEXT

page 2

page 7

page 15

page 16

research
03/29/2023

When to Pre-Train Graph Neural Networks? An Answer from Data Generation Perspective!

Recently, graph pre-training has attracted wide research attention, whic...
research
10/05/2022

clip2latent: Text driven sampling of a pre-trained StyleGAN using denoising diffusion and CLIP

We introduce a new method to efficiently create text-to-image models fro...
research
04/25/2023

Patch Diffusion: Faster and More Data-Efficient Training of Diffusion Models

Diffusion models are powerful, but they require a lot of time and data t...
research
11/17/2022

Conffusion: Confidence Intervals for Diffusion Models

Diffusion models have become the go-to method for many generative tasks,...
research
07/26/2023

Pre-Training with Diffusion models for Dental Radiography segmentation

Medical radiography segmentation, and specifically dental radiography, i...
research
01/18/2023

Targeted Image Reconstruction by Sampling Pre-trained Diffusion Model

A trained neural network model contains information on the training data...
research
05/18/2023

Democratized Diffusion Language Model

Despite the potential benefits of Diffusion Models for NLP applications,...

Please sign up or login with your details

Forgot password? Click here to reset