ALR-GAN: Adaptive Layout Refinement for Text-to-Image Synthesis

04/13/2023
by   Hongchen Tan, et al.
0

We propose a novel Text-to-Image Generation Network, Adaptive Layout Refinement Generative Adversarial Network (ALR-GAN), to adaptively refine the layout of synthesized images without any auxiliary information. The ALR-GAN includes an Adaptive Layout Refinement (ALR) module and a Layout Visual Refinement (LVR) loss. The ALR module aligns the layout structure (which refers to locations of objects and background) of a synthesized image with that of its corresponding real image. In ALR module, we proposed an Adaptive Layout Refinement (ALR) loss to balance the matching of hard and easy features, for more efficient layout structure matching. Based on the refined layout structure, the LVR loss further refines the visual representation within the layout area. Experimental results on two widely-used datasets show that ALR-GAN performs competitively at the Text-to-Image generation task.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 9

page 10

research
02/16/2023

LayoutDiffuse: Adapting Foundational Diffusion Models for Layout-to-Image Generation

Layout-to-image generation refers to the task of synthesizing photo-real...
research
01/10/2018

Instance Map based Image Synthesis with a Denoising Generative Adversarial Network

Semantic layouts based Image synthesizing, which has benefited from the ...
research
10/01/2017

Video Generation From Text

Generating videos from text has proven to be a significant challenge for...
research
08/05/2019

Visual-Relation Conscious Image Generation from Structured-Text

Generating realistic images from text descriptions is a challenging prob...
research
04/02/2019

DM-GAN: Dynamic Memory Generative Adversarial Networks for Text-to-Image Synthesis

In this paper, we focus on generating realistic images from text descrip...
research
02/27/2019

Object-driven Text-to-Image Synthesis via Adversarial Training

In this paper, we propose Object-driven Attentive Generative Adversarial...
research
09/03/2022

DSE-GAN: Dynamic Semantic Evolution Generative Adversarial Network for Text-to-Image Generation

Text-to-image generation aims at generating realistic images which are s...

Please sign up or login with your details

Forgot password? Click here to reset