CRD-CGAN: Category-Consistent and Relativistic Constraints for Diverse Text-to-Image Generation

07/28/2021
by   Tao Hu, et al.
7

Generating photo-realistic images from a text description is a challenging problem in computer vision. Previous works have shown promising performance to generate synthetic images conditional on text by Generative Adversarial Networks (GANs). In this paper, we focus on the category-consistent and relativistic diverse constraints to optimize the diversity of synthetic images. Based on those constraints, a category-consistent and relativistic diverse conditional GAN (CRD-CGAN) is proposed to synthesize K photo-realistic images simultaneously. We use the attention loss and diversity loss to improve the sensitivity of the GAN to word attention and noises. Then, we employ the relativistic conditional loss to estimate the probability of relatively real or fake for synthetic images, which can improve the performance of basic conditional loss. Finally, we introduce a category-consistent loss to alleviate the over-category issues between K synthetic images. We evaluate our approach using the Birds-200-2011, Oxford-102 flower and MSCOCO 2014 datasets, and the extensive experiments demonstrate superiority of the proposed method in comparison with state-of-the-art methods in terms of photorealistic and diversity of the generated synthetic images.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

page 7

page 13

page 14

research
04/02/2019

Semantics Disentangling for Text-to-Image Generation

Synthesizing photo-realistic images from text descriptions is a challeng...
research
12/10/2016

StackGAN: Text to Photo-realistic Image Synthesis with Stacked Generative Adversarial Networks

Synthesizing high-quality images from text descriptions is a challenging...
research
05/30/2019

Towards Photo-Realistic Visible Watermark Removal with Conditional Generative Adversarial Networks

Visible watermark plays an important role in image copyright protection ...
research
06/28/2018

High Diversity Attribute Guided Face Generation with GANs

In this work we focused on GAN-based solution for the attribute guided f...
research
08/05/2020

F2GAN: Fusing-and-Filling GAN for Few-shot Image Generation

In order to generate images for a given category, existing deep generati...
research
07/27/2022

Lighting (In)consistency of Paint by Text

Whereas generative adversarial networks are capable of synthesizing high...
research
02/23/2022

Art Creation with Multi-Conditional StyleGANs

Creating meaningful art is often viewed as a uniquely human endeavor. A ...

Please sign up or login with your details

Forgot password? Click here to reset