GLIDE: Towards Photorealistic Image Generation and Editing with Text-Guided Diffusion Models

12/20/2021
by   Alex Nichol, et al.
14

Diffusion models have recently been shown to generate high-quality synthetic images, especially when paired with a guidance technique to trade off diversity for fidelity. We explore diffusion models for the problem of text-conditional image synthesis and compare two different guidance strategies: CLIP guidance and classifier-free guidance. We find that the latter is preferred by human evaluators for both photorealism and caption similarity, and often produces photorealistic samples. Samples from a 3.5 billion parameter text-conditional diffusion model using classifier-free guidance are favored by human evaluators to those from DALL-E, even when the latter uses expensive CLIP reranking. Additionally, we find that our models can be fine-tuned to perform image inpainting, enabling powerful text-driven image editing. We train a smaller model on a filtered dataset and release the code and weights at https://github.com/openai/glide-text2im.

READ FULL TEXT

page 2

page 3

page 7

page 14

page 15

page 16

page 18

page 19

research
07/26/2022

Classifier-Free Diffusion Guidance

Classifier guidance is a recently introduced method to trade off mode co...
research
05/01/2023

In-Context Learning Unlocked for Diffusion Models

We present Prompt Diffusion, a framework for enabling in-context learnin...
research
05/11/2023

Null-text Guidance in Diffusion Models is Secretly a Cartoon-style Creator

Classifier-free guidance is an effective sampling technique in diffusion...
research
04/02/2023

Textile Pattern Generation Using Diffusion Models

The problem of text-guided image generation is a complex task in Compute...
research
03/23/2023

End-to-End Diffusion Latent Optimization Improves Classifier Guidance

Classifier guidance – using the gradients of an image classifier to stee...
research
07/17/2023

Not All Steps are Created Equal: Selective Diffusion Distillation for Image Manipulation

Conditional diffusion models have demonstrated impressive performance in...
research
03/30/2023

Discriminative Class Tokens for Text-to-Image Diffusion Models

Recent advances in text-to-image diffusion models have enabled the gener...

Please sign up or login with your details

Forgot password? Click here to reset