Markedness in Visual Semantic AI

05/23/2022
by   Robert Wolfe, et al.
0

We evaluate the state-of-the-art multimodal "visual semantic" model CLIP ("Contrastive Language Image Pretraining") for biases related to the marking of age, gender, and race or ethnicity. Given the option to label an image as "a photo of a person" or to select a label denoting race or ethnicity, CLIP chooses the "person" label 47.9 with 5.0 Indian, or Latino or Hispanic. The model is more likely to rank the unmarked "person" label higher than labels denoting gender for Male individuals (26.7 of the time) vs. Female individuals (15.2 individual is marked by the model: Female individuals under the age of 20 are more likely than Male individuals to be marked with a gender label, but less likely to be marked with an age label, while Female individuals over the age of 40 are more likely to be marked based on age than Male individuals. We also examine the self-similarity (mean pairwise cosine similarity) for each social group, where higher self-similarity denotes greater attention directed by CLIP to the shared characteristics (age, race, or gender) of the social group. As age increases, the self-similarity of representations of Female individuals increases at a higher rate than for Male individuals, with the disparity most pronounced at the "more than 70" age range. All ten of the most self-similar social groups are individuals under the age of 10 or over the age of 70, and six of the ten are Female individuals. Existing biases of self-similarity and markedness between Male and Female gender groups are further exacerbated when the groups compared are individuals who are White and Male and individuals who are Black and Female. Results indicate that CLIP reflects the biases of the language and society which produced its training data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2022

American == White in Multimodal Language-and-Image AI

Three state-of-the-art language-and-image AI models, CLIP, SLIP, and BLI...
research
04/21/2021

Stormwater on the Margins: Influence of Race, Gender, and Education on Willingness to Participate in Stormwater Management

Stormwater has immense impacts on urban flooding and water quality, leav...
research
08/29/2022

Evolving Label Usage within Generation Z when Self-Describing Sexual Orientation

Evaluating change in ranked term importance in a growing corpus is a pow...
research
05/22/2022

Evidence for Hypodescent in Visual Semantic AI

We examine the state-of-the-art multimodal "visual semantic" model CLIP ...
research
10/08/2019

Stochastic modeling of hyposmotic lysis and characterization of different osmotic stability subgroups of human erythrocytes

This study proposes a novel stochastic model for the study of hyposmotic...
research
03/14/2022

Contrastive Visual Semantic Pretraining Magnifies the Semantics of Natural Language Representations

We examine the effects of contrastive visual semantic pretraining by com...
research
03/22/2021

Higher-order Homophily is Combinatorially Impossible

Homophily is the seemingly ubiquitous tendency for people to connect wit...

Please sign up or login with your details

Forgot password? Click here to reset