Lipstick on a Pig: Debiasing Methods Cover up Systematic Gender Biases in Word Embeddings But do not Remove Them

03/09/2019
by   Hila Gonen, et al.
0

Word embeddings are widely used in NLP for a vast range of tasks. It was shown that word embeddings derived from text corpora reflect gender biases in society. This phenomenon is pervasive and consistent across different word embedding models, causing serious concern. Several recent works tackle this problem, and propose methods for significantly reducing this gender bias in word embeddings, demonstrating convincing results. However, we argue that this removal is superficial. While the bias is indeed substantially reduced according to the provided bias definition, the actual effect is mostly hiding the bias, not removing it. The gender bias information is still reflected in the distances between "gender-neutralized" words in the debiased embeddings, and can be recovered from them. We present a series of experiments to support this claim, for two debiasing methods. We conclude that existing bias removal techniques are insufficient, and should not be trusted for providing gender-neutral modeling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Conceptor Debiasing of Word Representations Evaluated on WEAT

Bias in word embeddings such as Word2Vec has been widely investigated, a...
research
10/30/2020

"Thy algorithm shalt not bear false witness": An Evaluation of Multiclass Debiasing Methods on Word Embeddings

With the vast development and employment of artificial intelligence appl...
research
07/21/2021

Using Adversarial Debiasing to Remove Bias from Word Embeddings

Word Embeddings have been shown to contain the societal biases present i...
research
06/29/2021

Sexism in the Judiciary

We analyze 6.7 million case law documents to determine the presence of g...
research
06/20/2016

Quantifying and Reducing Stereotypes in Word Embeddings

Machine learning algorithms are optimized to model statistical propertie...
research
09/02/2020

Gender Stereotype Reinforcement: Measuring the Gender Bias Conveyed by Ranking Algorithms

Search Engines (SE) have been shown to perpetuate well-known gender ster...
research
08/29/2018

Learning Gender-Neutral Word Embeddings

Word embedding models have become a fundamental component in a wide rang...

Please sign up or login with your details

Forgot password? Click here to reset