Gender Recognition in Informal and Formal Language Scenarios via Transfer Learning

06/23/2021
by   Daniel Escobar-Grisales, et al.
0

The interest in demographic information retrieval based on text data has increased in the research community because applications have shown success in different sectors such as security, marketing, heath-care, and others. Recognition and identification of demographic traits such as gender, age, location, or personality based on text data can help to improve different marketing strategies. For instance it makes it possible to segment and to personalize offers, thus products and services are exposed to the group of greatest interest. This type of technology has been discussed widely in documents from social media. However, the methods have been poorly studied in data with a more formal structure, where there is no access to emoticons, mentions, and other linguistic phenomena that are only present in social media. This paper proposes the use of recurrent and convolutional neural networks, and a transfer learning strategy for gender recognition in documents that are written in informal and formal languages. Models are tested in two different databases consisting of Tweets and call-center conversations. Accuracies of up to 75% are achieved for both databases. The results also indicate that it is possible to transfer the knowledge from a system trained on a specific type of expressions or idioms such as those typically used in social media into a more formal type of text data, where the amount of data is more scarce and its structure is completely different.

READ FULL TEXT
research
04/28/2023

The social media use of adult New Zealanders: Evidence from an online survey

To explore social media use in New Zealand, a sample of 1001 adults aged...
research
09/30/2020

Point-of-Interest Type Inference from Social Media Text

Physical places help shape how we perceive the experiences we have there...
research
05/28/2018

A visual approach for age and gender identification on Twitter

The goal of Author Profiling (AP) is to identify demographic aspects (e....
research
03/18/2021

Gender and Racial Fairness in Depression Research using Social Media

Multiple studies have demonstrated that behavior on internet-based socia...
research
08/31/2016

Demographic Dialectal Variation in Social Media: A Case Study of African-American English

Though dialectal language is increasingly abundant on social media, few ...
research
11/07/2018

Transfer Learning from LDA to BiLSTM-CNN for Offensive Language Detection in Twitter

We investigate different strategies for automatic offensive language cla...
research
11/28/2022

A Survey of Relevant Text Mining Technology

Recent advances in text mining and natural language processing technolog...

Please sign up or login with your details

Forgot password? Click here to reset