Interactive Image Synthesis with Panoptic Layout Generation

03/04/2022
by   Bo Wang, et al.
11

Interactive image synthesis from user-guided input is a challenging task when users wish to control the scene structure of a generated image with ease.Although remarkable progress has been made on layout-based image synthesis approaches, in order to get realistic fake image in interactive scene, existing methods require high-precision inputs, which probably need adjustment several times and are unfriendly to novice users. When placement of bounding boxes is subject to perturbation, layout-based models suffer from "missing regions" in the constructed semantic layouts and hence undesirable artifacts in the generated images. In this work, we propose Panoptic Layout Generative Adversarial Networks (PLGAN) to address this challenge. The PLGAN employs panoptic theory which distinguishes object categories between "stuff" with amorphous boundaries and "things" with well-defined shapes, such that stuff and instance layouts are constructed through separate branches and later fused into panoptic layouts. In particular, the stuff layouts can take amorphous shapes and fill up the missing regions left out by the instance layouts. We experimentally compare our PLGAN with state-of-the-art layout-based models on the COCO-Stuff, Visual Genome, and Landscape datasets. The advantages of PLGAN are not only visually demonstrated but quantitatively verified in terms of inception score, Fréchet inception distance, classification accuracy score, and coverage.

READ FULL TEXT

page 1

page 3

page 4

page 6

page 7

research
03/25/2020

Learning Layout and Style Reconfigurable GANs for Controllable Image Synthesis

With the remarkable recent progress on learning deep generative models, ...
research
08/27/2020

Attribute-guided image generation from layout

Recent approaches have achieved great success in image generation from s...
research
11/26/2019

Semantic Bottleneck Scene Generation

Coupling the high-fidelity generation capabilities of label-conditional ...
research
08/20/2019

Image Synthesis From Reconfigurable Layout and Style

Despite remarkable recent progress on both unconditional and conditional...
research
06/03/2021

Semantic Palette: Guiding Scene Generation with Class Proportions

Despite the recent progress of generative adversarial networks (GANs) at...
research
05/25/2023

Towards Language-guided Interactive 3D Generation: LLMs as Layout Interpreter with Generative Feedback

Generating and editing a 3D scene guided by natural language poses a cha...
research
08/28/2020

Person-in-Context Synthesiswith Compositional Structural Space

Despite significant progress, controlled generation of complex images wi...

Please sign up or login with your details

Forgot password? Click here to reset