Understanding Undesirable Word Embedding Associations

08/18/2019
by   Kawin Ethayarajh, et al.
0

Word embeddings are often criticized for capturing undesirable word associations such as gender stereotypes. However, methods for measuring and removing such biases remain poorly understood. We show that for any embedding model that implicitly does matrix factorization, debiasing vectors post hoc using subspace projection (Bolukbasi et al., 2016) is, under certain conditions, equivalent to training on an unbiased corpus. We also prove that WEAT, the most common association test for word embeddings, systematically overestimates bias. Given that the subspace projection method is provably effective, we use it to derive a new measure of association called the relational inner product association (RIPA). Experiments with RIPA reveal that, on average, skipgram with negative sampling (SGNS) does not make most words any more gendered than they are in the training corpus. However, for gender-stereotyped words, SGNS actually amplifies the gender association in the corpus.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2019

Conceptor Debiasing of Word Representations Evaluated on WEAT

Bias in word embeddings such as Word2Vec has been widely investigated, a...
research
06/03/2022

Measuring Gender Bias in Word Embeddings of Gendered Languages Requires Disentangling Grammatical Gender Signals

Does the grammatical gender of a language interfere when measuring the s...
research
03/14/2018

LSH Microbatches for Stochastic Gradients: Value in Rearrangement

Metric embeddings are immensely useful representation of interacting ent...
research
12/20/2018

What are the biases in my word embedding?

This paper presents an algorithm for enumerating biases in word embeddin...
research
12/13/2018

An Unbiased Approach to Quantification of Gender Inclination using Interpretable Word Representations

Recent advances in word embedding provide significant benefit to various...
research
06/25/2021

A Source-Criticism Debiasing Method for GloVe Embeddings

It is well-documented that word embeddings trained on large public corpo...
research
09/03/2019

Detecting Compromised Implicit Association Test Results Using Supervised Learning

An implicit association test is a human psychological test used to measu...

Please sign up or login with your details

Forgot password? Click here to reset