PasteGAN: A Semi-Parametric Method to Generate Image from Scene Graph

05/05/2019
by   Yikang Li, et al.
0

Despite some exciting progress on high-quality image generation from structured (scene graphs) or free-form (sentences) descriptions, most of them only guarantee the image-level semantical consistency, the generated image matching the semantic meaning of the description. However, it still lacks the investigations on synthesizing the images in a more controllable way, like finely manipulating the visual appearance of every object. Therefore, to generate the images with preferred objects and rich interactions, we propose a semi-parametric method, denoted as PasteGAN, for generating the image from the scene graph, where spatial arrangements of the objects and their pair-wise relationships are defined by the scene graph and the object appearances are determined by given object crops. To enhance the interactions of the objects in the output, we design a Crop Refining Network to embed the objects as well as their relationships into one map. Multiple losses work collaboratively to guarantee the generated images highly respecting the crops and complying with the scene graphs while maintaining excellent image quality. A crop selector is also proposed to pick the most-compatible crops from our external object tank by encoding the interactions around the objects in the scene graph if the crops are not provided. Evaluated on Visual Genome and COCO-Stuff, our proposed method significantly outperforms the SOTA methods on both Inception Score and Diversity Score with a huge margin. Extensive experiments also demonstrate our method's ability to generate complex and diverse images with given objects.

READ FULL TEXT

page 1

page 8

research
04/01/2021

Exploiting Relationship for Complex-scene Image Generation

The significant progress on Generative Adversarial Networks (GANs) has f...
research
04/04/2018

Image Generation from Scene Graphs

To truly understand the visual world our models should be able not only ...
research
06/06/2021

MOC-GAN: Mixing Objects and Captions to Generate Realistic Images

Generating images with conditional descriptions gains increasing interes...
research
05/09/2019

Interactive Image Generation Using Scene Graphs

Recent years have witnessed some exciting developments in the domain of ...
research
03/08/2023

Transformer-based Image Generation from Scene Graphs

Graph-structured scene descriptions can be efficiently used in generativ...
research
07/01/2022

Transforming Image Generation from Scene Graphs

Generating images from semantic visual knowledge is a challenging task, ...
research
10/31/2022

Intelligent Painter: Picture Composition With Resampling Diffusion Model

Have you ever thought that you can be an intelligent painter? This means...

Please sign up or login with your details

Forgot password? Click here to reset