Multi-attribute Pizza Generator: Cross-domain Attribute Control with Conditional StyleGAN

10/22/2021
by   Fangda Han, et al.
6

Multi-attribute conditional image generation is a challenging problem in computervision. We propose Multi-attribute Pizza Generator (MPG), a conditional Generative Neural Network (GAN) framework for synthesizing images from a trichotomy of attributes: content, view-geometry, and implicit visual style. We design MPG by extending the state-of-the-art StyleGAN2, using a new conditioning technique that guides the intermediate feature maps to learn multi-scale multi-attribute entangled representationsof controlling attributes. Because of the complex nature of the multi-attribute image generation problem, we regularize the image generation by predicting the explicit conditioning attributes (ingredients and view). To synthesize a pizza image with view attributesoutside the range of natural training images, we design a CGI pizza dataset PizzaView using 3D pizza models and employ it to train a view attribute regressor to regularize the generation process, bridging the real and CGI training datasets. To verify the efficacy of MPG, we test it on Pizza10, a carefully annotated multi-ingredient pizza image dataset. MPG can successfully generate photo-realistic pizza images with desired ingredients and view attributes, beyond the range of those observed in real-world training data.

READ FULL TEXT

page 6

page 8

page 9

page 14

page 15

page 16

page 17

page 19

research
12/04/2020

MPG: A Multi-ingredient Pizza Image Generator with Conditional StyleGANs

Multilabel conditional image generation is a challenging problem in comp...
research
04/26/2023

Ray Conditioning: Trading Photo-consistency for Photo-realism in Multi-view Image Generation

Multi-view image generation attracts particular attention these days due...
research
02/17/2019

Fully-Featured Attribute Transfer

Image attribute transfer aims to change an input image to a target one w...
research
09/28/2022

Adma-GAN: Attribute-Driven Memory Augmented GANs for Text-to-Image Generation

As a challenging task, text-to-image generation aims to generate photo-r...
research
08/02/2023

AutoPoster: A Highly Automatic and Content-aware Design System for Advertising Poster Generation

Advertising posters, a form of information presentation, combine visual ...
research
01/04/2023

Attribute-Centric Compositional Text-to-Image Generation

Despite the recent impressive breakthroughs in text-to-image generation,...
research
11/09/2018

Changing the Image Memorability: From Basic Photo Editing to GANs

Memorability is considered to be an important characteristic of visual c...

Please sign up or login with your details

Forgot password? Click here to reset