Inferring gender from name: a large scale performance evaluation study

08/22/2023
by   Kriste Krstovski, et al.
0

A person's gender is a crucial piece of information when performing research across a wide range of scientific disciplines, such as medicine, sociology, political science, and economics, to name a few. However, in increasing instances, especially given the proliferation of big data, gender information is not readily available. In such cases researchers need to infer gender from readily available information, primarily from persons' names. While inferring gender from name may raise some ethical questions, the lack of viable alternatives means that researchers have to resort to such approaches when the goal justifies the means - in the majority of such studies the goal is to examine patterns and determinants of gender disparities. The necessity of name-to-gender inference has generated an ever-growing domain of algorithmic approaches and software products. These approaches have been used throughout the world in academia, industry, governmental and non-governmental organizations. Nevertheless, the existing approaches have yet to be systematically evaluated and compared, making it challenging to determine the optimal approach for future research. In this work, we conducted a large scale performance evaluation of existing approaches for name-to-gender inference. Analysis are performed using a variety of large annotated datasets of names. We further propose two new hybrid approaches that achieve better performance than any single existing approach.

READ FULL TEXT

page 5

page 13

page 14

research
09/29/2022

Temporal Analysis and Gender Bias in Computing

Recent studies of gender bias in computing use large datasets involving ...
research
02/01/2023

For the Underrepresented in Gender Bias Research: Chinese Name Gender Prediction with Heterogeneous Graph Attention Network

Achieving gender equality is an important pillar for humankind's sustain...
research
06/13/2019

Advance gender prediction tool of first names and its use in analysing gender disparity in Computer Science in the UK, Malaysia and China

Global gender disparity in science is an unsolved problem. Predicting ge...
research
06/07/2023

Gender, names and other mysteries: Towards the ambiguous for gender-inclusive translation

The vast majority of work on gender in MT focuses on 'unambiguous' input...
research
05/12/2023

Global method for gender profile estimation from distribution of first names

As social issues related to gender bias attract closer scrutiny, accurat...
research
05/13/2021

Pink for Princesses, Blue for Superheroes: The Need to Examine Gender Stereotypes in Kid's Products in Search and Recommendations

In this position paper, we argue for the need to investigate if and how ...
research
09/30/2020

Using sex and gender in survey adjustment

Accounting for sex and gender characteristics is a complex, structural c...

Please sign up or login with your details

Forgot password? Click here to reset