Large-scale Gender/Age Prediction of Tumblr Users

01/02/2020
by   Yao Zhan, et al.
3

Tumblr, as a leading content provider and social media, attracts 371 million monthly visits, 280 million blogs and 53.3 million daily posts. The popularity of Tumblr provides great opportunities for advertisers to promote their products through sponsored posts. However, it is a challenging task to target specific demographic groups for ads, since Tumblr does not require user information like gender and ages during their registration. Hence, to promote ad targeting, it is essential to predict user's demography using rich content such as posts, images and social connections. In this paper, we propose graph based and deep learning models for age and gender predictions, which take into account user activities and content features. For graph based models, we come up with two approaches, network embedding and label propagation, to generate connection features as well as directly infer user's demography. For deep learning models, we leverage convolutional neural network (CNN) and multilayer perceptron (MLP) to prediction users' age and gender. Experimental results on real Tumblr daily dataset, with hundreds of millions of active users and billions of following relations, demonstrate that our approaches significantly outperform the baseline model, by improving the accuracy relatively by 81 age, and the AUC and accuracy by 5% for gender.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2021

What's in a Name? – Gender Classification of Names with Character Based Machine Learning Models

Gender information is no longer a mandatory input when registering for a...
research
06/23/2016

Gender and Interest Targeting for Sponsored Post Advertising at Tumblr

As one of the leading platforms for creative content, Tumblr offers adve...
research
02/22/2018

Sleep-deprived Fatigue Pattern Analysis using Large-Scale Selfies from Social Med

The complexities of fatigue have drawn much attention from researchers a...
research
07/24/2020

Detecting Online Hate Speech: Approaches Using Weak Supervision and Network Embedding Models

The ubiquity of social media has transformed online interactions among i...
research
05/15/2023

Text2Gender: A Deep Learning Architecture for Analysis of Blogger's Age and Gender

Deep learning techniques have gained a lot of traction in the field of N...
research
01/05/2020

User Profiling Using Hinge-loss Markov Random Fields

A variety of approaches have been proposed to automatically infer the pr...
research
10/10/2018

Inferring User Gender from User Generated Visual Content on a Deep Semantic Space

In this paper we address the task of gender classification on picture sh...

Please sign up or login with your details

Forgot password? Click here to reset