Inferring Semantic Layout for Hierarchical Text-to-Image Synthesis

01/16/2018
by   Seunghoon Hong, et al.
0

We propose a novel hierarchical approach for text-to-image synthesis by inferring semantic layout. Instead of learning a direct mapping from text to image, our algorithm decomposes the generation process into multiple steps, in which it first constructs a semantic layout from the text by the layout generator and converts the layout to an image by the image generator. The proposed layout generator progressively constructs a semantic layout in a coarse-to-fine manner by generating object bounding boxes and refining each box by estimating object shapes inside the box. The image generator synthesizes an image conditioned on the inferred semantic layout, which provides a useful semantic structure of an image matching with the text description. Our model not only generates semantically more meaningful images, but also allows automatic annotation of generated images and user-controlled generation process by modifying the generated scene layout. We demonstrate the capability of the proposed model on challenging MS-COCO dataset and show that the model can substantially improve the image quality, interpretability of output and semantic alignment to input text over existing approaches.

READ FULL TEXT

page 1

page 6

page 7

page 8

page 11

page 13

page 14

page 15

research
08/12/2022

Layout-Bridging Text-to-Image Synthesis

The crux of text-to-image synthesis stems from the difficulty of preserv...
research
08/22/2018

Learning Hierarchical Semantic Image Manipulation through Structured Representations

Understanding, reasoning, and manipulating semantic concepts of images h...
research
07/17/2015

Tree-based Visualization and Optimization for Image Collection

The visualization of an image collection is the process of displaying a ...
research
02/27/2019

Object-driven Text-to-Image Synthesis via Adversarial Training

In this paper, we propose Object-driven Attentive Generative Adversarial...
research
03/15/2021

Deep Consensus Learning

Both generative learning and discriminative learning have recently witne...
research
08/19/2019

Seq-SG2SL: Inferring Semantic Layout from Scene Graph Through Sequence to Sequence Learning

Generating semantic layout from scene graph is a crucial intermediate ta...
research
05/30/2020

OPAL-Net: A Generative Model for Part-based Object Layout Generation

We propose OPAL-Net, a novel hierarchical architecture for part-based la...

Please sign up or login with your details

Forgot password? Click here to reset