DualVAE: Controlling Colours of Generated and Real Images

05/30/2023
by   Keerth Rathakumar, et al.
0

Colour controlled image generation and manipulation are of interest to artists and graphic designers. Vector Quantised Variational AutoEncoders (VQ-VAEs) with autoregressive (AR) prior are able to produce high quality images, but lack an explicit representation mechanism to control colour attributes. We introduce DualVAE, a hybrid representation model that provides such control by learning disentangled representations for colour and geometry. The geometry is represented by an image intensity mapping that identifies structural features. The disentangled representation is obtained by two novel mechanisms: (i) a dual branch architecture that separates image colour attributes from geometric attributes, and (ii) a new ELBO that trains the combined colour and geometry representations. DualVAE can control the colour of generated images, and recolour existing images by transferring the colour latent representation obtained from an exemplar image. We demonstrate that DualVAE generates images with FID nearly two times better than VQ-GAN on a diverse collection of datasets, including animated faces, logos and artistic landscapes.

READ FULL TEXT

page 7

page 16

page 17

page 18

page 19

page 20

page 22

page 23

research
01/07/2021

GAN-Control: Explicitly Controllable GANs

We present a framework for training GANs with explicit control over gene...
research
12/02/2015

Attribute2Image: Conditional Image Generation from Visual Attributes

This paper investigates a novel problem of generating images from visual...
research
11/25/2020

StyleSpace Analysis: Disentangled Controls for StyleGAN Image Generation

We explore and analyze the latent style space of StyleGAN2, a state-of-t...
research
01/28/2019

PuppetGAN: Transferring Disentangled Properties from Synthetic to Real Images

In this work we propose a model that enables controlled manipulation of ...
research
04/30/2023

Learning Structured Output Representations from Attributes using Deep Conditional Generative Models

Structured output representation is a generative task explored in comput...
research
12/25/2019

Learning Controllable Disentangled Representations with Decorrelation Regularization

A crucial problem in learning disentangled image representations is cont...
research
03/09/2023

StyleDiff: Attribute Comparison Between Unlabeled Datasets in Latent Disentangled Space

One major challenge in machine learning applications is coping with mism...

Please sign up or login with your details

Forgot password? Click here to reset