Don't Forget About Pronouns: Removing Gender Bias in Language Models Without Losing Factual Gender Information

06/21/2022
by   Tomasz Limisiewicz, et al.
0

The representations in large language models contain multiple types of gender information. We focus on two types of such signals in English texts: factual gender information, which is a grammatical or semantic property, and gender bias, which is the correlation between a word and specific gender. We can disentangle the model's embeddings and identify components encoding both types of information with probing. We aim to diminish the stereotypical bias in the representations while preserving the factual gender signal. Our filtering method shows that it is possible to decrease the bias of gender-neutral profession names without significant deterioration of language modeling capabilities. The findings can be applied to language generation to mitigate reliance on stereotypes while preserving gender agreement in coreferences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2023

Public Perceptions of Gender Bias in Large Language Models: Cases of ChatGPT and Ernie

Large language models are quickly gaining momentum, yet are found to dem...
research
12/16/2021

Gendered Language in Resumes and its Implications for Algorithmic Bias in Hiring

Despite growing concerns around gender bias in NLP models used in algori...
research
05/23/2023

Run Like a Girl! Sports-Related Gender Bias in Language and Vision

Gender bias in Language and Vision datasets and models has the potential...
research
01/23/2019

Attenuating Bias in Word Vectors

Word vector representations are well developed tools for various NLP and...
research
11/01/2019

On the Unintended Social Bias of Training Language Generation Models with Data from Local Media

There are concerns that neural language models may preserve some of the ...
research
07/16/2021

Intersectional Bias in Causal Language Models

To examine whether intersectional bias can be observed in language gener...
research
11/17/2022

Professional Presentation and Projected Power: A Case Study of Implicit Gender Information in English CVs

Gender discrimination in hiring is a pertinent and persistent bias in so...

Please sign up or login with your details

Forgot password? Click here to reset