Nurse is Closer to Woman than Surgeon? Mitigating Gender-Biased Proximities in Word Embeddings

06/02/2020
by   Vaibhav Kumar, et al.
0

Word embeddings are the standard model for semantic and syntactic representations of words. Unfortunately, these models have been shown to exhibit undesirable word associations resulting from gender, racial, and religious biases. Existing post-processing methods for debiasing word embeddings are unable to mitigate gender bias hidden in the spatial arrangement of word vectors. In this paper, we propose RAN-Debias, a novel gender debiasing methodology which not only eliminates the bias present in a word vector but also alters the spatial distribution of its neighbouring vectors, achieving a bias-free setting while maintaining minimal semantic offset. We also propose a new bias evaluation metric - Gender-based Illicit Proximity Estimate (GIPE), which measures the extent of undue proximity in word vectors resulting from the presence of gender-based predilections. Experiments based on a suite of evaluation metrics show that RAN-Debias significantly outperforms the state-of-the-art in reducing proximity bias (GIPE) by at least 42.02 reduces direct bias, adding minimal semantic disturbance, and achieves the best performance in a downstream application task (coreference resolution).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2019

A Causal Inference Method for Reducing Gender Bias in Word Embedding Relations

Word embedding has become essential for natural language processing as i...
research
05/18/2020

Grammatical gender associations outweigh topical gender bias in crosslinguistic word embeddings

Recent research has demonstrated that vector space models of semantics c...
research
06/01/2021

Gender Bias Hidden Behind Chinese Word Embeddings: The Case of Chinese Adjectives

Gender bias in word embeddings gradually becomes a vivid research field ...
research
06/20/2020

MDR Cluster-Debias: A Nonlinear WordEmbedding Debiasing Pipeline

Existing methods for debiasing word embeddings often do so only superfic...
research
04/13/2021

On the interpretation and significance of bias metrics in texts: a PMI-based approach

In recent years, the use of word embeddings has become popular to measur...
research
05/26/2022

Do interests affect grant application success? The role of organizational proximity

Bias in grant allocation is a critical issue, as the expectation is that...
research
09/28/2021

Marked Attribute Bias in Natural Language Inference

Reporting and providing test sets for harmful bias in NLP applications i...

Please sign up or login with your details

Forgot password? Click here to reset