Fair GANs through model rebalancing with synthetic data

08/16/2023
by   Anubhav Jain, et al.
0

Deep generative models require large amounts of training data. This often poses a problem as the collection of datasets can be expensive and difficult, in particular datasets that are representative of the appropriate underlying distribution (e.g. demographic). This introduces biases in datasets which are further propagated in the models. We present an approach to mitigate biases in an existing generative adversarial network by rebalancing the model distribution. We do so by generating balanced data from an existing unbalanced deep generative model using latent space exploration and using this data to train a balanced generative model. Further, we propose a bias mitigation loss function that shows improvements in the fairness metric even when trained with unbalanced datasets. We show results for the Stylegan2 models while training on the FFHQ dataset for racial fairness and see that the proposed approach improves on the fairness metric by almost 5 times, whilst maintaining image quality. We further validate our approach by applying it to an imbalanced Cifar-10 dataset. Lastly, we argue that the traditionally used image quality metrics such as Frechet inception distance (FID) are unsuitable for bias mitigation problems.

READ FULL TEXT

page 1

page 11

research
07/16/2021

Measuring Fairness in Generative Models

Deep generative models have made much progress in improving training sta...
research
03/09/2022

Downstream Fairness Caveats with Synthetic Healthcare Data

This paper evaluates synthetically generated healthcare data for biases ...
research
06/20/2022

Convex space learning improves deep-generative oversampling for tabular imbalanced classification on smaller datasets

Data is commonly stored in tabular format. Several fields of research (e...
research
12/09/2020

Improving the Fairness of Deep Generative Models without Retraining

Generative Adversarial Networks (GANs) have recently advanced face synth...
research
04/07/2021

Representative Fair Synthetic Data

Algorithms learn rules and associations based on the training data that ...
research
08/05/2021

Sketch Your Own GAN

Can a user create a deep generative model by sketching a single example?...
research
09/18/2023

What is a Fair Diffusion Model? Designing Generative Text-To-Image Models to Incorporate Various Worldviews

Generative text-to-image (GTI) models produce high-quality images from s...

Please sign up or login with your details

Forgot password? Click here to reset