Identifying and Reducing Gender Bias in Word-Level Language Models

04/05/2019
by   Shikha Bordia, et al.
0

Many text corpora exhibit socially problematic biases, which can be propagated or amplified in the models trained on such data. For example, doctor cooccurs more frequently with male pronouns than female pronouns. In this study we (i) propose a metric to measure gender bias; (ii) measure bias in a text corpus and the text generated from a recurrent neural network language model trained on the text corpus; (iii) propose a regularization loss term for the language model that minimizes the projection of encoder-trained embeddings onto an embedding subspace that encodes gender; (iv) finally, evaluate efficacy of our proposed method on reducing gender bias. We find this regularization method to be effective in reducing gender bias up to an optimal weight assigned to the loss term, beyond which the model becomes unstable as the perplexity increases. We replicate this study on three training corpora---Penn Treebank, WikiText-2, and CNN/Daily Mail---resulting in similar conclusions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/22/2018

Reducing Gender Bias in Abusive Language Detection

Abusive language detection models tend to have a problem of being biased...
research
07/29/2017

Men Also Like Shopping: Reducing Gender Bias Amplification using Corpus-level Constraints

Language is increasingly being used to define rich visual recognition pr...
research
07/16/2023

Analysing Gender Bias in Text-to-Image Models using Object Detection

This work presents a novel strategy to measure bias in text-to-image mod...
research
10/12/2021

Deep Learning for Bias Detection: From Inception to Deployment

To create a more inclusive workplace, enterprises are actively investing...
research
12/14/2022

Unsupervised Detection of Contextualized Embedding Bias with Application to Ideology

We propose a fully unsupervised method to detect bias in contextualized ...
research
09/07/2017

Cynical Selection of Language Model Training Data

The Moore-Lewis method of "intelligent selection of language model train...
research
11/21/2022

Validating Large Language Models with ReLM

Although large language models (LLMs) have been touted for their ability...

Please sign up or login with your details

Forgot password? Click here to reset