Mitigating Gender Bias in Captioning Systems

06/15/2020
by   Ruixiang Tang, et al.
0

Image captioning has made substantial progress with huge supporting image collections sourced from the web. However, recent studies have pointed out that captioning datasets, such as COCO, contain gender bias found in web corpora. As a result, learning models could heavily rely on the learned priors and image context for gender identification, leading to incorrect or even offensive errors. To encourage models to learn correct gender features, we reorganize the COCO dataset and present two new splits COCO-GB V1 and V2 datasets where the train and test sets have different gender-context joint distribution. Models relying on contextual cues will suffer from huge gender prediction errors on the anti-stereotypical test data. Benchmarking experiments reveal that most captioning models learn gender bias, leading to high gender prediction errors, especially for women. To alleviate the unwanted bias, we propose a new Guided Attention Image Captioning model (GAIC) which provides self-guidance on visual attention to encourage the model to capture correct gender visual evidence. Experimental results validate that GAIC can significantly reduce gender prediction errors with a competitive caption quality. Our codes and the designed benchmark datasets are available at https://github.com/CaptionGenderBias2020.

READ FULL TEXT

page 3

page 8

page 12

page 13

page 15

research
12/02/2019

Exposing and Correcting the Gender Bias in Image Captioning Datasets and Models

The task of image captioning implicitly involves gender identification. ...
research
04/07/2023

Model-Agnostic Gender Debiased Image Captioning

Image captioning models are known to perpetuate and amplify harmful soci...
research
06/16/2021

Understanding and Evaluating Racial Biases in Image Captioning

Image captioning is an important task for benchmarking visual reasoning ...
research
04/10/2023

ImageCaptioner^2: Image Captioner for Image Captioning Bias Amplification Assessment

Most pre-trained learning systems are known to suffer from bias, which t...
research
03/09/2020

Deconfounded Image Captioning: A Causal Retrospect

The dataset bias in vision-language tasks is becoming one of the main pr...
research
07/29/2017

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

Language is increasingly being used to define rich visual recognition pr...
research
06/18/2022

Gender Artifacts in Visual Datasets

Gender biases are known to exist within large-scale visual datasets and ...

Please sign up or login with your details

Forgot password? Click here to reset