Exploring the Linear Subspace Hypothesis in Gender Bias Mitigation

09/20/2020
by   Francisco Vargas, et al.
0

Bolukbasi et al. (2016) presents one of the first gender bias mitigation techniques for word embeddings. Their method takes pre-trained word embeddings as input and attempts to isolate a linear subspace that captures most of the gender bias in the embeddings. As judged by an analogical evaluation task, their method virtually eliminates gender bias in the embeddings. However, an implicit and untested assumption of their method is that the bias sub-space is actually linear. In this work, we generalize their method to a kernelized, non-linear version. We take inspiration from kernel principal component analysis and derive a non-linear bias isolation technique. We discuss and overcome some of the practical drawbacks of our method for non-linear gender bias mitigation in word embeddings and analyze empirically whether the bias subspace is actually linear. Our analysis shows that gender bias is in fact well captured by a linear subspace, justifying the assumption of Bolukbasi et al. (2016).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2020

Double-Hard Debias: Tailoring Word Embeddings for Gender Bias Mitigation

Word embeddings derived from human-generated corpora inherit strong gend...
research
08/07/2019

Debiasing Embeddings for Reduced Gender Bias in Text Classification

(Bolukbasi et al., 2016) demonstrated that pretrained word embeddings ca...
research
06/01/2021

Gender Bias Hidden Behind Chinese Word Embeddings: The Case of Chinese Adjectives

Gender bias in word embeddings gradually becomes a vivid research field ...
research
10/22/2019

Grammatical Gender, Neo-Whorfianism, and Word Embeddings: A Data-Driven Approach to Linguistic Relativity

The relation between language and thought has occupied linguists for at ...
research
05/16/2020

Towards classification parity across cohorts

Recently, there has been a lot of interest in ensuring algorithmic fairn...
research
01/28/2022

Linear Adversarial Concept Erasure

Modern neural models trained on textual data rely on pre-trained represe...
research
07/27/2023

A Geometric Notion of Causal Probing

Large language models rely on real-valued representations of text to mak...

Please sign up or login with your details

Forgot password? Click here to reset