What's in a Name? – Gender Classification of Names with Character Based Machine Learning Models

02/07/2021
by   Yifan Hu, et al.
0

Gender information is no longer a mandatory input when registering for an account at many leading Internet companies. However, prediction of demographic information such as gender and age remains an important task, especially in intervention of unintentional gender/age bias in recommender systems. Therefore it is necessary to infer the gender of those users who did not to provide this information during registration. We consider the problem of predicting the gender of registered users based on their declared name. By analyzing the first names of 100M+ users, we found that genders can be very effectively classified using the composition of the name strings. We propose a number of character based machine learning models, and demonstrate that our models are able to infer the gender of users with much higher accuracy than baseline models. Moreover, we show that using the last names in addition to the first names improves classification performance further.

READ FULL TEXT

page 4

page 5

research
07/22/2017

Predicting the Gender of Indonesian Names

We investigated a way to predict the gender of a name using character-le...
research
06/18/2021

Predicting gender of Brazilian names using deep learning

Predicting gender by the name is not a simple task. In many applications...
research
01/02/2020

Large-scale Gender/Age Prediction of Tumblr Users

Tumblr, as a leading content provider and social media, attracts 371 mil...
research
02/01/2023

For the Underrepresented in Gender Bias Research: Chinese Name Gender Prediction with Heterogeneous Graph Attention Network

Achieving gender equality is an important pillar for humankind's sustain...
research
05/26/2023

Gender, Smoking History and Age Prediction from Laryngeal Images

Flexible laryngoscopy is commonly performed by otolaryngologists to dete...
research
10/27/2020

It's All in the Name: A Character Based Approach To Infer Religion

Demographic inference from text has received a surge of attention in the...
research
12/18/2020

Small Business Classification By Name: Addressing Gender and Geographic Origin Biases

Small business classification is a difficult and important task within m...

Please sign up or login with your details

Forgot password? Click here to reset