Predicting gender of Brazilian names using deep learning

06/18/2021
by   Rosana C. B. Rego, et al.
0

Predicting gender by the name is not a simple task. In many applications, especially in the natural language processing (NLP) field, this task may be necessary, mainly when considering foreign names. Some machine learning algorithms can satisfactorily perform the prediction. In this paper, we examined and implemented feedforward and recurrent deep neural network models, such as MLP, RNN, GRU, CNN, and BiLSTM, to classify gender through the first name. A dataset of Brazilian names is used to train and evaluate the models. We analyzed the accuracy, recall, precision, and confusion matrix to measure the models' performances. The results indicate that the gender prediction can be performed from the feature extraction strategy looking at the names as a set of strings. Some models accurately predict the gender in more than 90 cases. The recurrent models overcome the feedforward models in this binary classification problem.

READ FULL TEXT
research
02/07/2021

What's in a Name? – Gender Classification of Names with Character Based Machine Learning Models

Gender information is no longer a mandatory input when registering for a...
research
07/22/2017

Predicting the Gender of Indonesian Names

We investigated a way to predict the gender of a name using character-le...
research
06/17/2016

Gender Inference using Statistical Name Characteristics in Twitter

Much attention has been given to the task of gender inference of Twitter...
research
10/21/2020

Gender Prediction Based on Vietnamese Names with Machine Learning Techniques

As biological gender is one of the aspects of presenting individual huma...
research
12/04/2019

Handwriting-Based Gender Classification Using End-to-End Deep Neural Networks

Handwriting-based gender classification is a well-researched problem tha...
research
02/25/2019

Neural Reverse Engineering of Stripped Binaries

We address the problem of predicting procedure names in stripped executa...
research
10/27/2020

It's All in the Name: A Character Based Approach To Infer Religion

Demographic inference from text has received a surge of attention in the...

Please sign up or login with your details

Forgot password? Click here to reset