FACET: Fairness in Computer Vision Evaluation Benchmark

08/31/2023
by   Laura Gustafson, et al.
0

Computer vision models have known performance disparities across attributes such as gender and skin tone. This means during tasks such as classification and detection, model performance differs for certain classes based on the demographics of the people in the image. These disparities have been shown to exist, but until now there has not been a unified approach to measure these differences for common use-cases of computer vision models. We present a new benchmark named FACET (FAirness in Computer Vision EvaluaTion), a large, publicly available evaluation set of 32k images for some of the most common vision tasks - image classification, object detection and segmentation. For every image in FACET, we hired expert reviewers to manually annotate person-related attributes such as perceived skin tone and hair type, manually draw bounding boxes and label fine-grained person-related classes such as disk jockey or guitarist. In addition, we use FACET to benchmark state-of-the-art vision models and present a deeper understanding of potential performance disparities and challenges across sensitive demographic attributes. With the exhaustive annotations collected, we probe models using single demographics attributes as well as multiple attributes using an intersectional approach (e.g. hair color and perceived skin tone). Our results show that classification, detection, segmentation, and visual grounding models exhibit performance disparities across demographic attributes and intersections of attributes. These harms suggest that not all people represented in datasets receive fair and equitable treatment in these vision tasks. We hope current and future results using our benchmark will contribute to fairer, more robust vision models. FACET is available publicly at https://facet.metademolab.com/

READ FULL TEXT

page 1

page 25

research
09/10/2023

Beyond Skin Tone: A Multidimensional Measure of Apparent Skin Color

This paper strives to measure apparent skin color in computer vision, be...
research
05/05/2021

A Step Toward More Inclusive People Annotations for Fairness

The Open Images Dataset contains approximately 9 million images and is a...
research
05/03/2019

Auditing ImageNet: Towards a Model-driven Framework for Annotating Demographic Attributes of Large-Scale Image Datasets

The ImageNet dataset ushered in a flood of academic and industry interes...
research
02/16/2023

Towards Reliable Assessments of Demographic Disparities in Multi-Label Image Classifiers

Disaggregated performance metrics across demographic groups are a hallma...
research
02/15/2022

Fairness Indicators for Systematic Assessments of Visual Feature Extractors

Does everyone equally benefit from computer vision systems? Answers to t...
research
06/02/2022

Improving Fairness in Large-Scale Object Recognition by CrowdSourced Demographic Information

There has been increasing awareness of ethical issues in machine learnin...
research
09/07/2017

Fine-Grained Car Detection for Visual Census Estimation

Targeted socioeconomic policies require an accurate understanding of a c...

Please sign up or login with your details

Forgot password? Click here to reset