FairFace: Face Attribute Dataset for Balanced Race, Gender, and Age

08/14/2019
by   Kimmo Karkkainen, et al.
15

Existing public face datasets are strongly biased toward Caucasian faces, and other races (e.g., Latino) are significantly underrepresented. This can lead to inconsistent model accuracy, limit the applicability of face analytic systems to non-White race groups, and adversely affect research findings based on such skewed data. To mitigate the race bias in these datasets, we construct a novel face image dataset, containing 108,501 images, with an emphasis of balanced race composition in the dataset. We define 7 race groups: White, Black, Indian, East Asian, Southeast Asian, Middle East, and Latino. Images were collected from the YFCC-100M Flickr dataset and labeled with race, gender, and age groups. Evaluations were performed on existing face attribute datasets as well as novel image datasets to measure generalization performance. We find that the model trained from our dataset is substantially more accurate on novel datasets and the accuracy is consistent between race and gender groups.

READ FULL TEXT

page 4

page 6

research
11/02/2022

Bias-Aware Face Mask Detection Dataset

In December 2019, a novel coronavirus (COVID-19) spread so quickly aroun...
research
12/01/2017

Improving Smiling Detection with Race and Gender Diversity

Recent progress in deep learning has been accompanied by a growing conce...
research
05/10/2023

Analyzing Bias in Diffusion-based Face Generation Models

Diffusion models are becoming increasingly popular in synthetic data gen...
research
03/22/2017

Can you tell where in India I am from? Comparing humans and computers on fine-grained race face classification

Faces form the basis for a rich variety of judgments in humans, yet the ...
research
06/06/2020

Enhancing Facial Data Diversity with Style-based Face Aging

A significant limiting factor in training fair classifiers relates to th...
research
10/11/2021

EDFace-Celeb-1M: Benchmarking Face Hallucination with a Million-scale Dataset

Recent deep face hallucination methods show stunning performance in supe...
research
09/06/2022

Studying Bias in GANs through the Lens of Race

In this work, we study how the performance and evaluation of generative ...

Please sign up or login with your details

Forgot password? Click here to reset