Toward responsible face datasets: modeling the distribution of a disentangled latent space for sampling face images from demographic groups

09/15/2023
by   Parsa Rahimi, et al.
0

Recently, it has been exposed that some modern facial recognition systems could discriminate specific demographic groups and may lead to unfair attention with respect to various facial attributes such as gender and origin. The main reason are the biases inside datasets, unbalanced demographics, used to train theses models. Unfortunately, collecting a large-scale balanced dataset with respect to various demographics is impracticable. In this paper, we investigate as an alternative the generation of a balanced and possibly bias-free synthetic dataset that could be used to train, to regularize or to evaluate deep learning-based facial recognition models. We propose to use a simple method for modeling and sampling a disentangled projection of a StyleGAN latent space to generate any combination of demographic groups (e.g. hispanic-female). Our experiments show that we can synthesis any combination of demographic groups effectively and the identities are different from the original training dataset. We also released the source code.

READ FULL TEXT

page 1

page 5

page 7

page 8

research
04/17/2023

A Real Balanced Dataset For Understanding Bias? Factors That Impact Accuracy, Not Numbers of Identities and Images

The issue of disparities in face recognition accuracy across demographic...
research
08/07/2023

Balanced Face Dataset: Guiding StyleGAN to Generate Labeled Synthetic Face Image Dataset for Underrepresented Group

For a machine learning model to generalize effectively to unseen data wi...
research
05/12/2023

Zero-shot racially balanced dataset generation using an existing biased StyleGAN2

Facial recognition systems have made significant strides thanks to data-...
research
10/11/2022

Gender Stereotyping Impact in Facial Expression Recognition

Facial Expression Recognition (FER) uses images of faces to identify the...
research
03/16/2021

Balancing Biases and Preserving Privacy on Balanced Faces in the Wild

There are demographic biases in the SOTA CNN used for FR. Our BFW datase...
research
12/04/2019

Algorithmic Discrimination: Formulation and Exploration in Deep Learning-based Face Biometrics

The most popular face recognition benchmarks assume a distribution of su...
research
09/30/2018

Identifying Bias in AI using Simulation

Machine learned models exhibit bias, often because the datasets used to ...

Please sign up or login with your details

Forgot password? Click here to reset