Reducing Gender Bias in Word-Level Language Models with a Gender-Equalizing Loss Function

05/30/2019
by   Yusu Qian, et al.
0

Gender bias exists in natural language datasets which neural language models tend to learn, resulting in biased text generation. In this research, we propose a debiasing approach based on the loss function modification. We introduce a new term to the loss function which attempts to equalize the probabilities of male and female words in the output. Using an array of bias evaluation metrics, we provide empirical evidence that our approach successfully mitigates gender bias in language models without increasing perplexity. In comparison to existing debiasing strategies, data augmentation, and word embedding debiasing, our method performs better in several aspects, especially in reducing gender bias in occupation words. Finally, we introduce a combination of data augmentation and our approach, and show that it outperforms existing strategies in all bias evaluation metrics.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2018

Gender Bias in Neural Natural Language Processing

We examine whether neural natural language processing (NLP) systems refl...
research
03/10/2021

Interpretable bias mitigation for textual data: Reducing gender bias in patient notes while maintaining classification performance

Medical systems in general, and patient treatment decisions and outcomes...
research
02/24/2023

In-Depth Look at Word Filling Societal Bias Measures

Many measures of societal bias in language models have been proposed in ...
research
06/12/2023

Gender-Inclusive Grammatical Error Correction through Augmentation

In this paper we show that GEC systems display gender bias related to th...
research
04/18/2018

Gender Bias in Coreference Resolution: Evaluation and Debiasing Methods

We introduce a new benchmark, WinoBias, for coreference resolution focus...
research
10/26/2020

PowerTransformer: Unsupervised Controllable Revision for Biased Language Correction

Unconscious biases continue to be prevalent in modern text and media, ca...
research
05/24/2023

Balancing the Picture: Debiasing Vision-Language Datasets with Synthetic Contrast Sets

Vision-language models are growing in popularity and public visibility t...

Please sign up or login with your details

Forgot password? Click here to reset