Survey on Sociodemographic Bias in Natural Language Processing

06/13/2023
by   Vipul Gupta, et al.
0

Deep neural networks often learn unintended biases during training, which might have harmful effects when deployed in real-world settings. This paper surveys 209 papers on bias in NLP models, most of which address sociodemographic bias. To better understand the distinction between bias and real-world harm, we turn to ideas from psychology and behavioral economics to propose a definition for sociodemographic bias. We identify three main categories of NLP bias research: types of bias, quantifying bias, and debiasing. We conclude that current approaches on quantifying bias face reliability issues, that many of the bias metrics do not relate to real-world biases, and that current debiasing techniques are superficial and hide bias rather than removing it. Finally, we provide recommendations for future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2023

On the Origins of Bias in NLP through the Lens of the Jim Code

In this paper, we trace the biases in current natural language processin...
research
09/11/2023

Challenges in Annotating Datasets to Quantify Bias in Under-represented Society

Recent advances in artificial intelligence, including the development of...
research
04/18/2018

Modeling and Simultaneously Removing Bias via Adversarial Neural Networks

In real world systems, the predictions of deployed Machine Learned model...
research
05/30/2023

Examining risks of racial biases in NLP tools for child protective services

Although much literature has established the presence of demographic bia...
research
03/20/2023

Self-Improving-Leaderboard(SIL): A Call for Real-World Centric Natural Language Processing Leaderboards

Leaderboard systems allow researchers to objectively evaluate Natural La...
research
10/07/2020

Bias and Debias in Recommender System: A Survey and Future Directions

While recent years have witnessed a rapid growth of research papers on r...
research
09/07/2022

Power of Explanations: Towards automatic debiasing in hate speech detection

Hate speech detection is a common downstream application of natural lang...

Please sign up or login with your details

Forgot password? Click here to reset