Divide and Compose with Score Based Generative Models

02/05/2023
by   Sandesh Ghimire, et al.
0

While score based generative models, or diffusion models, have found success in image synthesis, they are often coupled with text data or image label to be able to manipulate and conditionally generate images. Even though manipulation of images by changing the text prompt is possible, our understanding of the text embedding and our ability to modify it to edit images is quite limited. Towards the direction of having more control over image manipulation and conditional generation, we propose to learn image components in an unsupervised manner so that we can compose those components to generate and manipulate images in informed manner. Taking inspiration from energy based models, we interpret different score components as the gradient of different energy functions. We show how score based learning allows us to learn interesting components and we can visualize them through generation. We also show how this novel decomposition allows us to compose, generate and modify images in interesting ways akin to dreaming. We make our code available at https://github.com/sandeshgh/Score-based-disentanglement

READ FULL TEXT

page 5

page 6

page 7

page 8

page 9

research
09/14/2022

Generative Visual Prompt: Unifying Distributional Control of Pre-Trained Generative Models

Generative models (e.g., GANs and diffusion models) learn the underlying...
research
03/30/2023

Forget-Me-Not: Learning to Forget in Text-to-Image Diffusion Models

The unlearning problem of deep learning models, once primarily an academ...
research
12/02/2022

QC-StyleGAN – Quality Controllable Image Generation and Manipulation

The introduction of high-quality image generation models, particularly t...
research
05/22/2023

If at First You Don't Succeed, Try, Try Again: Faithful Diffusion-based Text-to-Image Generation by Selection

Despite their impressive capabilities, diffusion-based text-to-image (T2...
research
03/11/2023

DeltaEdit: Exploring Text-free Training for Text-Driven Image Manipulation

Text-driven image manipulation remains challenging in training or infere...
research
02/22/2023

Entity-Level Text-Guided Image Manipulation

Existing text-guided image manipulation methods aim to modify the appear...
research
07/01/2020

Swapping Autoencoder for Deep Image Manipulation

Deep generative models have become increasingly effective at producing r...

Please sign up or login with your details

Forgot password? Click here to reset