The Secret Lives of Names? Name Embeddings from Social Media

05/12/2019
by   Junting Ye, et al.
0

Your name tells a lot about you: your gender, ethnicity and so on. It has been shown that name embeddings are more effective in representing names than traditional substring features. However, our previous name embedding model is trained on private email data and are not publicly accessible. In this paper, we explore learning name embeddings from public Twitter data. We argue that Twitter embeddings have two key advantages: (i) they can and will be publicly released to support research community. (ii) even with a smaller training corpus, Twitter embeddings achieve similar performances on multiple tasks comparing to email embeddings. As a test case to show the power of name embeddings, we investigate the modeling of lifespans. We find it interesting that adding name embeddings can further improve the performances of models using demographic features, which are traditionally used for lifespan modeling. Through residual analysis, we observe that fine-grained groups (potentially reflecting socioeconomic status) are the latent contributing factors encoded in name embeddings. These were previously hidden to demographic models, and may help to enhance the predictive power of a wide class of research studies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/25/2017

Nationality Classification Using Name Embeddings

Nationality identification unlocks important demographic information, wi...
research
11/05/2021

SocialVec: Social Entity Embeddings

This paper introduces SocialVec, a general framework for eliciting socia...
research
03/18/2021

Gender and Racial Fairness in Depression Research using Social Media

Multiple studies have demonstrated that behavior on internet-based socia...
research
08/08/2021

Efficacy of BERT embeddings on predicting disaster from Twitter data

Social media like Twitter provide a common platform to share and communi...
research
03/30/2018

Characterizing Interconnections and Linguistic Patterns in Twitter

Social media is considered a democratic space in which people connect an...
research
07/24/2019

Linking Physicians to Medical Research Results via Knowledge Graph Embeddings and Twitter

Informing professionals about the latest research results in their field...
research
09/18/2018

Fighting Redundancy and Model Decay with Embeddings

Every day, hundreds of millions of new Tweets containing over 40 languag...

Please sign up or login with your details

Forgot password? Click here to reset