A Survey on Gender Bias in Natural Language Processing

12/28/2021
by   Karolina Stańczak, et al.
0

Language can be used as a means of reproducing and enforcing harmful stereotypes and biases and has been analysed as such in numerous research. In this paper, we present a survey of 304 papers on gender bias in natural language processing. We analyse definitions of gender and its categories within social sciences and connect them to formal definitions of gender bias in NLP research. We survey lexica and datasets applied in research on gender bias and then compare and contrast approaches to detecting and mitigating gender bias. We find that research on gender bias suffers from four core limitations. 1) Most research treats gender as a binary variable neglecting its fluidity and continuity. 2) Most of the work has been conducted in monolingual setups for English or other high-resource languages. 3) Despite a myriad of papers on gender bias in NLP methods, we find that most of the newly developed algorithms do not test their models for bias and disregard possible ethical considerations of their work. 4) Finally, methodologies developed in this line of research are fundamentally flawed covering very limited definitions of gender bias and lacking evaluation baselines and pipelines. We suggest recommendations towards overcoming these limitations as a guide for future research.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/21/2019

Mitigating Gender Bias in Natural Language Processing: Literature Review

As Natural Language Processing (NLP) and Machine Learning (ML) tools ris...
research
05/05/2022

Theories of "Gender" in NLP Bias Research

The rise of concern around Natural Language Processing (NLP) technologie...
research
01/12/2023

Much Ado About Gender: Current Practices and Future Recommendations for Appropriate Gender-Aware Information Access

Information access research (and development) sometimes makes use of gen...
research
05/03/2020

Gender Gap in Natural Language Processing Research: Disparities in Authorship and Citations

Disparities in authorship and citations across gender can have substanti...
research
05/28/2020

Language (Technology) is Power: A Critical Survey of "Bias" in NLP

We survey 146 papers analyzing "bias" in NLP systems, finding that their...
research
06/04/2023

Taught by the Internet, Exploring Bias in OpenAIs GPT3

This research delves into the current literature on bias in Natural Lang...
research
06/18/2023

Gender Bias in Transformer Models: A comprehensive survey

Gender bias in artificial intelligence (AI) has emerged as a pressing co...

Please sign up or login with your details

Forgot password? Click here to reset