PRedItOR: Text Guided Image Editing with Diffusion Prior

02/15/2023
by   Hareesh Ravi, et al.
2

Diffusion models have shown remarkable capabilities in generating high quality and creative images conditioned on text. An interesting application of such models is structure preserving text guided image editing. Existing approaches rely on text conditioned diffusion models such as Stable Diffusion or Imagen and require compute intensive optimization of text embeddings or fine-tuning the model weights for text guided image editing. We explore text guided image editing with a Hybrid Diffusion Model (HDM) architecture similar to DALLE-2. Our architecture consists of a diffusion prior model that generates CLIP image embedding conditioned on a text prompt and a custom Latent Diffusion Model trained to generate images conditioned on CLIP image embedding. We discover that the diffusion prior model can be used to perform text guided conceptual edits on the CLIP image embedding space without any finetuning or optimization. We combine this with structure preserving edits on the image decoder using existing approaches such as reverse DDIM to perform text guided image editing. Our approach, PRedItOR does not require additional inputs, fine-tuning, optimization or objectives and shows on par or better results than baselines qualitatively and quantitatively. We provide further analysis and understanding of the diffusion prior model and believe this opens up new possibilities in diffusion models research.

READ FULL TEXT

page 6

page 7

page 13

page 14

page 17

page 19

page 20

page 21

research
02/23/2023

Controlled and Conditional Text to Image Generation with Diffusion Prior

Denoising Diffusion models have shown remarkable performance in generati...
research
09/19/2023

Forgedit: Text Guided Image Editing via Learning and Forgetting

Text guided image editing on real images given only the image and the ta...
research
05/10/2023

iEdit: Localised Text-guided Image Editing with Weak Supervision

Diffusion models (DMs) can generate realistic images with text guidance ...
research
09/20/2023

Face Aging via Diffusion-based Editing

In this paper, we address the problem of face aging: generating past or ...
research
03/22/2023

Pix2Video: Video Editing using Image Diffusion

Image diffusion models, trained on massive image collections, have emerg...
research
05/31/2023

Diffusion Brush: A Latent Diffusion Model-based Editing Tool for AI-generated Images

Text-to-image generative models have made remarkable advancements in gen...
research
03/15/2023

Highly Personalized Text Embedding for Image Manipulation by Stable Diffusion

Diffusion models have shown superior performance in image generation and...

Please sign up or login with your details

Forgot password? Click here to reset