Burstiness Scale: a highly parsimonious model for characterizing random series of events

02/20/2016
by   Rodrigo A S Alves, et al.
0

The problem to accurately and parsimoniously characterize random series of events (RSEs) present in the Web, such as e-mail conversations or Twitter hashtags, is not trivial. Reports found in the literature reveal two apparent conflicting visions of how RSEs should be modeled. From one side, the Poissonian processes, of which consecutive events follow each other at a relatively regular time and should not be correlated. On the other side, the self-exciting processes, which are able to generate bursts of correlated events and periods of inactivities. The existence of many and sometimes conflicting approaches to model RSEs is a consequence of the unpredictability of the aggregated dynamics of our individual and routine activities, which sometimes show simple patterns, but sometimes results in irregular rising and falling trends. In this paper we propose a highly parsimonious way to characterize general RSEs, namely the Burstiness Scale (BuSca) model. BuSca views each RSE as a mix of two independent process: a Poissonian and a self-exciting one. Here we describe a fast method to extract the two parameters of BuSca that, together, gives the burstyness scale, which represents how much of the RSE is due to bursty and viral effects. We validated our method in eight diverse and large datasets containing real random series of events seen in Twitter, Yelp, e-mail conversations, Digg, and online forums. Results showed that, even using only two parameters, BuSca is able to accurately describe RSEs seen in these diverse systems, what can leverage many applications.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/15/2012

Modeling Events with Cascades of Poisson Processes

We present a probabilistic model of events in continuous time in which e...
research
08/21/2016

Mining of health and disease events on Twitter: validating search protocols within the setting of Indonesia

This study seeks to validate a search protocol of ill health-related ter...
research
09/29/2019

Annotating Antisemitic Online Content. Towards an Applicable Definition of Antisemitism

Online antisemitism is hard to quantify. How can it be measured in rapid...
research
06/26/2023

Recurring patterns in online social media interactions during highly engaging events

People nowadays express their opinions in online spaces, using different...
research
09/15/2022

How Does Twitter Account Moderation Work? Dynamics of Account Creation and Suspension During Major Geopolitical Events

Social media moderation policies are often at the center of public debat...
research
01/14/2021

On Informative Tweet Identification For Tracking Mass Events

Twitter has been heavily used as an important channel for communicating ...
research
06/22/2017

Speaker Recognition with Cough, Laugh and "Wei"

This paper proposes a speaker recognition (SRE) task with trivial speech...

Please sign up or login with your details

Forgot password? Click here to reset