Gender Gap in Natural Language Processing Research: Disparities in Authorship and Citations

05/03/2020
by   Saif M. Mohammad, et al.
0

Disparities in authorship and citations across gender can have substantial adverse consequences not just on the disadvantaged genders, but also on the field of study as a whole. Measuring gender gaps is a crucial step towards addressing them. In this work, we examine female first author percentages and the citations to their papers in Natural Language Processing (1965 to 2019). We determine aggregate-level statistics using an existing manually curated author–gender list as well as first names strongly associated with a gender. We find that only about 29 last authors are female. Notably, this percentage has not improved since the mid 2000s. We also show that, on average, female first authors are cited less than male first authors, even when controlling for experience and area of research. Finally, we discuss the ethical considerations involved in automatic demographic analysis.

READ FULL TEXT

page 5

page 7

research
12/28/2021

A Survey on Gender Bias in Natural Language Processing

Language can be used as a means of reproducing and enforcing harmful ste...
research
09/28/2021

How Different Text-preprocessing Techniques Using The BERT Model Affect The Gender Profiling of Authors

Forensic author profiling plays an important role in indicating possible...
research
04/14/2020

Author Name Disambiguation in Bibliographic Databases: A Survey

Entity resolution is a challenging and hot research area in the field of...
research
04/05/2021

Citations and gender diversity in reciprocal acknowledgement networks

Acknowledgements in scientific articles suggest not only gratitude, but ...
research
04/12/2022

Robust Quantification of Gender Disparity in Pre-Modern English Literature using Natural Language Processing

Research has continued to shed light on the extent and significance of g...
research
12/05/2022

Editing a Woman's Voice

Prior work shows that men and women speak with different levels of confi...
research
11/08/2019

The State of NLP Literature: A Diachronic Analysis of the ACL Anthology

The ACL Anthology (AA) is a digital repository of tens of thousands of a...

Please sign up or login with your details

Forgot password? Click here to reset