One Label, One Billion Faces: Usage and Consistency of Racial Categories in Computer Vision

02/03/2021
by   Zaid Khan, et al.
0

Computer vision is widely deployed, has highly visible, society altering applications, and documented problems with bias and representation. Datasets are critical for benchmarking progress in fair computer vision, and often employ broad racial categories as population groups for measuring group fairness. Similarly, diversity is often measured in computer vision datasets by ascribing and counting categorical race labels. However, racial categories are ill-defined, unstable temporally and geographically, and have a problematic history of scientific use. Although the racial categories used across datasets are superficially similar, the complexity of human race perception suggests the racial system encoded by one dataset may be substantially inconsistent with another. Using the insight that a classifier can learn the racial system encoded by a dataset, we conduct an empirical study of computer vision datasets supplying categorical race labels for face images to determine the cross-dataset consistency and generalization of racial categories. We find that each dataset encodes a substantially unique racial system, despite nominally equivalent racial categories, and some racial categories are systemically less consistent than others across datasets. We find evidence that racial categories encode stereotypes, and exclude ethnic groups from categories on the basis of nonconformity to stereotypes. Representing a billion humans under one racial category may obscure disparities and create new ones by encoding stereotypes of racial systems. The difficulty of adequately converting the abstract concept of race into a tool for measuring fairness underscores the need for a method more flexible and culturally aware than racial categories.

READ FULL TEXT

page 2

page 8

research
12/16/2019

Towards Fairer Datasets: Filtering and Balancing the Distribution of the People Subtree in the ImageNet Hierarchy

Computer vision technology is being used by many but remains representat...
research
03/27/2023

Measuring Categorical Perception in Color-Coded Scatterplots

Scatterplots commonly use color to encode categorical data. However, as ...
research
03/09/2022

Leveling Down in Computer Vision: Pareto Inefficiencies in Fair Deep Classifiers

Algorithmic fairness is frequently motivated in terms of a trade-off in ...
research
11/27/2022

Searching for Uncollected Litter with Computer Vision

This study combines photo metadata and computer vision to quantify where...
research
11/19/2019

Shared Visual Abstractions

This paper presents abstract art created by neural networks and broadly ...
research
10/03/2018

SAVOIAS: A Diverse, Multi-Category Visual Complexity Dataset

Visual complexity identifies the level of intricacy and details in an im...
research
06/14/2022

Award rate inequities in biomedical research

The analysis of existing institutional research proposal databases can p...

Please sign up or login with your details

Forgot password? Click here to reset