MixNMatch: Multifactor Disentanglement and Encodingfor Conditional Image Generation

11/26/2019
by   Yuheng Li, et al.
12

We present MixNMatch, a conditional generative model that learns to disentangle and encode background, object pose, shape, and texture from real images with minimal supervision, for mix-and-match image generation. We build upon FineGAN, an unconditional generative model, to learn the desired disentanglement and image generator, and leverage adversarial joint image-code distribution matching to learn the latent factor encoders. MixNMatch requires bounding boxes during training to model background, but requires no other supervision. Through extensive experiments, we demonstrate MixNMatch's ability to accurately disentangle, encode, and combine multiple factors for mix-and-match image generation, including sketch2color, cartoon2img, and img2gif applications. Our code/models/demo can be found at https://github.com/Yuheng-Li/MixNMatch

READ FULL TEXT

page 1

page 5

page 6

page 7

page 8

page 14

page 15

page 16

research
11/26/2019

MixNMatch: Multifactor Disentanglement and Encoding for Conditional Image Generation

We present MixNMatch, a conditional generative model that learns to dise...
research
05/12/2023

Better speech synthesis through scaling

In recent years, the field of image generation has been revolutionized b...
research
11/19/2018

SEIGAN: Towards Compositional Image Generation by Simultaneously Learning to Segment, Enhance, and Inpaint

We present a novel approach to image manipulation and understanding by s...
research
01/30/2020

Adversarial Code Learning for Image Generation

We introduce the "adversarial code learning" (ACL) module that improves ...
research
05/14/2017

GeneGAN: Learning Object Transfiguration and Attribute Subspace from Unpaired Data

Object Transfiguration replaces an object in an image with another objec...
research
05/08/2017

Generative Cooperative Net for Image Generation and Data Augmentation

How to build a good model for image generation given an abstract concept...
research
04/05/2021

Generating Furry Cars: Disentangling Object Shape Appearance across Multiple Domains

We consider the novel task of learning disentangled representations of o...

Please sign up or login with your details

Forgot password? Click here to reset