A Prompt Array Keeps the Bias Away: Debiasing Vision-Language Models with Adversarial Learning

03/22/2022
by   Hugo Elias Berg, et al.
2

Vision-language models can encode societal biases and stereotypes, but there are challenges to measuring and mitigating these harms. Prior proposed bias measurements lack robustness and feature degradation occurs when mitigating bias without access to pretraining data. We address both of these challenges in this paper: First, we evaluate different bias measures and propose the use of retrieval metrics to image-text representations via a bias measuring framework. Second, we investigate debiasing methods and show that optimizing for adversarial loss via learnable token embeddings minimizes various bias measures without substantially degrading feature representations.

READ FULL TEXT
research
04/30/2021

Mitigating Political Bias in Language Models Through Reinforced Calibration

Current large-scale language models can be politically biased as a resul...
research
01/25/2021

Diverse Adversaries for Mitigating Bias in Training

Adversarial learning can learn fairer and less biased models of language...
research
02/20/2020

Measuring Social Biases in Grounded Vision and Language Embeddings

We generalize the notion of social biases from language embeddings to gr...
research
03/20/2022

Mitigating Gender Bias in Machine Translation through Adversarial Learning

Machine translation and other NLP systems often contain significant bias...
research
12/20/2022

Understanding Stereotypes in Language Models: Towards Robust Measurement and Zero-Shot Debiasing

Generated texts from large pretrained language models have been shown to...
research
02/24/2023

In-Depth Look at Word Filling Societal Bias Measures

Many measures of societal bias in language models have been proposed in ...
research
04/06/2023

Uncurated Image-Text Datasets: Shedding Light on Demographic Bias

The increasing tendency to collect large and uncurated datasets to train...

Please sign up or login with your details

Forgot password? Click here to reset