A Study of WhatsApp Usage Patterns and Prediction Models without Message Content

02/09/2018
by   Avi Rosenfeld, et al.
0

Internet social networks have become a ubiquitous application allowing people to easily share text, pictures, and audio and video files. Popular networks include WhatsApp, Facebook, Reddit and LinkedIn. We present an extensive study of the usage of the WhatsApp social network, an Internet messaging application that is quickly replacing SMS messaging. In order to better understand people's use of the network, we provide an analysis of over 6 million messages from over 100 users, with the objective of building demographic prediction models using activity data. We performed extensive statistical and numerical analysis of the data and found significant differences in WhatsApp usage across people of different genders and ages. We also inputted the data into the Weka data mining package and studied models created from decision tree and Bayesian network algorithms. We found that different genders and age demographics had significantly different usage habits in almost all message and group attributes. We also noted differences in users' group behavior and created prediction models, including the likelihood a given group would have relatively more file attachments, if a group would contain a larger number of participants, a higher frequency of activity, quicker response times and shorter messages. We were successful in quantifying and predicting a user's gender and age demographic. Similarly, we were able to predict different types of group usage. All models were built without analyzing message content. We present a detailed discussion about the specific attributes that were contained in all predictive models and suggest possible applications based on these results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/15/2021

How we browse: Measurement and analysis of digital behavior

Accurately analyzing and modeling online browsing behavior play a key ro...
research
12/01/2016

Analysis of the Human-Computer Interaction on the Example of Image-based CAPTCHA by Association Rule Mining

The paper analyzes the interaction between humans and computers in terms...
research
03/01/2020

User profiling using smartphone network traffic analysis

The recent decade has witnessed phenomenal growth in communication techn...
research
04/02/2018

Analyzing and characterizing political discussions in WhatsApp public groups

We present a thorough characterization of what we believe to be the firs...
research
04/09/2020

PANDORA Talks: Personality and Demographics on Reddit

Personality and demographics are important variables in social sciences,...
research
11/12/2018

Not Just Depressed: Bipolar Disorder Prediction on Reddit

Bipolar disorder, an illness characterized by manic and depressive episo...
research
07/16/2018

Methods, Forms and Safety of Learning in Corporate Social Networks

The paper discusses methods, forms and safety issues of social network u...

Please sign up or login with your details

Forgot password? Click here to reset