Optimized latent-code selection for explainable conditional text-to-image GANs

04/27/2022
by   Zhenxing Zhang, et al.
6

The task of text-to-image generation has achieved remarkable progress due to the advances in the conditional generative adversarial networks (GANs). However, existing conditional text-to-image GANs approaches mostly concentrate on improving both image quality and semantic relevance but ignore the explainability of the model which plays a vital role in real-world applications. In this paper, we present a variety of techniques to take a deep look into the latent space and semantic space of the conditional text-to-image GANs model. We introduce pairwise linear interpolation of latent codes and `linguistic' linear interpolation to study what the model has learned within the latent space and `linguistic' embeddings. Subsequently, we extend linear interpolation to triangular interpolation conditioned on three corners to further analyze the model. After that, we build a Good/Bad data set containing unsuccessfully and successfully synthetic samples and corresponding latent codes for the image-quality research. Based on this data set, we propose a framework for finding good latent codes by utilizing a linear SVM. Experimental results on the recent DiverGAN generator trained on two benchmark data sets qualitatively prove the effectiveness of our presented techniques, with a better than 94% accuracy in predicting Good/Bad classes for latent vectors. The Good/Bad data set is publicly available at https://zenodo.org/record/5850224#.YeGMwP7MKUk.

READ FULL TEXT

page 1

page 4

page 5

page 6

page 7

page 8

research
02/25/2022

OptGAN: Optimizing and Interpreting the Latent Space of the Conditional Text-to-Image GANs

Text-to-image generation intends to automatically produce a photo-realis...
research
11/14/2022

Fast Text-Conditional Discrete Denoising on Vector-Quantized Latent Spaces

Conditional text-to-image generation has seen countless recent improveme...
research
10/28/2022

Latent Space is Feature Space: Regularization Term for GANs Training on Limited Dataset

Generative Adversarial Networks (GAN) is currently widely used as an uns...
research
12/12/2017

Conditional Generative Adversarial Networks for Emoji Synthesis with Word Embedding Manipulation

Emojis have become a very popular part of daily digital communication. T...
research
11/12/2018

Agent Embeddings: A Latent Representation for Pole-Balancing Networks

We show that it is possible to reduce a high-dimensional object like a n...
research
08/24/2022

GAN-based generative modelling for dermatological applications – comparative study

The lack of sufficiently large open medical databases is one of the bigg...
research
09/14/2016

Sampling Generative Networks

We introduce several techniques for sampling and visualizing the latent ...

Please sign up or login with your details

Forgot password? Click here to reset