Word Embeddings via Causal Inference: Gender Bias Reducing and Semantic Information Preserving

12/09/2021
by   Lei Ding, et al.
0

With widening deployments of natural language processing (NLP) in daily life, inherited social biases from NLP models have become more severe and problematic. Previous studies have shown that word embeddings trained on human-generated corpora have strong gender biases that can produce discriminative results in downstream tasks. Previous debiasing methods focus mainly on modeling bias and only implicitly consider semantic information while completely overlooking the complex underlying causal structure among bias and semantic components. To address these issues, we propose a novel methodology that leverages a causal inference framework to effectively remove gender bias. The proposed method allows us to construct and analyze the complex causal mechanisms facilitating gender information flow while retaining oracle semantic information within word embeddings. Our comprehensive experiments show that the proposed method achieves state-of-the-art results in gender-debiasing tasks. In addition, our methods yield better performance in word similarity evaluation and various extrinsic downstream NLP tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/25/2019

A Causal Inference Method for Reducing Gender Bias in Word Embedding Relations

Word embedding has become essential for natural language processing as i...
research
04/14/2021

[RE] Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation

Despite widespread use in natural language processing (NLP) tasks, word ...
research
11/24/2019

Causally Denoise Word Embeddings Using Half-Sibling Regression

Distributional representations of words, also known as word vectors, hav...
research
05/04/2023

CausalAPM: Generalizable Literal Disentanglement for NLU Debiasing

Dataset bias, i.e., the over-reliance on dataset-specific literal heuris...
research
04/07/2020

Neutralizing Gender Bias in Word Embedding with Latent Disentanglement and Counterfactual Generation

Recent researches demonstrate that word embeddings, trained on the human...
research
09/13/2019

A General Framework for Implicit and Explicit Debiasing of Distributional Word Vector Spaces

Distributional word vectors have recently been shown to encode many of t...
research
04/26/2020

Causal Mediation Analysis for Interpreting Neural NLP: The Case of Gender Bias

Common methods for interpreting neural models in natural language proces...

Please sign up or login with your details

Forgot password? Click here to reset