StackGAN: Facial Image Generation Optimizations

08/30/2021
by   Badr Belhiti, et al.
6

Current state-of-the-art photorealistic generators are computationally expensive, involve unstable training processes, and have real and synthetic distributions that are dissimilar in higher-dimensional spaces. To solve these issues, we propose a variant of the StackGAN architecture. The new architecture incorporates conditional generators to construct an image in many stages. In our model, we generate grayscale facial images in two different stages: noise to edges (stage one) and edges to grayscale (stage two). Our model is trained with the CelebA facial image dataset and achieved a Fréchet Inception Distance (FID) score of 73 for edge images and a score of 59 for grayscale images generated using the synthetic edge images. Although our model achieved subpar results in relation to state-of-the-art models, dropout layers could reduce the overfitting in our conditional mapping. Additionally, since most images can be broken down into important features, improvements to our model can generalize to other datasets. Therefore, our model can potentially serve as a superior alternative to traditional means of generating photorealistic images.

READ FULL TEXT

page 10

page 11

page 12

page 13

research
08/19/2019

Fully Automated Image De-fencing using Conditional Generative Adversarial Networks

Image de-fencing is one of the important aspects of recreational photogr...
research
01/17/2018

Semi-supervised FusedGAN for Conditional Image Generation

We present FusedGAN, a deep network for conditional image synthesis with...
research
06/10/2020

Speech Fusion to Face: Bridging the Gap Between Human's Vocal Characteristics and Facial Imaging

While deep learning technologies are now capable of generating realistic...
research
03/19/2015

An approach to improving edge detection for facial and remotely sensed images using vector order statistics

This paper presents an improved edge detection algorithm for facial and ...
research
11/22/2018

TGANv2: Efficient Training of Large Models for Video Generation with Multiple Subsampling Layers

In this paper, we propose a novel method to efficiently train a Generati...
research
03/24/2020

Re-Training StyleGAN – A First Step Towards Building Large, Scalable Synthetic Facial Datasets

StyleGAN is a state-of-art generative adversarial network architecture t...
research
01/11/2021

ArrowGAN : Learning to Generate Videos by Learning Arrow of Time

Training GANs on videos is even more sophisticated than on images becaus...

Please sign up or login with your details

Forgot password? Click here to reset