Upscaling human activity data: an ecological perspective

12/06/2019
by   Anna Tovo, et al.
0

In recent years we have witnessed an explosion of data collected for different human dynamics, from email communication to social networks activities. Extract useful information from these huge data sets represents a major challenge. In the last decades, statistical regularities has been widely observed in human activities and various models have been proposed. Here we move from modeling to inference and propose a statistical framework capable to predict global features of human activities from local knowledge. We consider four data sets of human activities: email communication, Twitter posts, Wikipedia articles and Gutenberg books. From the statistics of local activities, such as sent emails per senders, post per hashtags and word occurrences collected in a small sample of the considered dataset, we infer global features, as the number of senders, hashtags and words at the global scale. Our estimates are robust and accurate with a small relative error. Moreover, we predict how abundance of a hashtag or of a word may change through scales. Thus, observing a small portion of tweets and the popularity of a given hashtag among them, we can estimate whether it will remain popular or not in the unseen part of the network. Our approach is grounded on statistical ecology as we discover inference of unseen human activity hallmarks can be mapped into the unseen species problem in biodiversity. Our findings may have applications to different areas, from resource management in emails to collective attention monitoring in Twitter and to language learning process in word databases.

READ FULL TEXT

page 6

page 7

research
07/04/2012

Unsupervised Activity Discovery and Characterization From Event-Streams

We present a framework to discover and characterize different classes of...
research
04/02/2020

PaStaNet: Toward Human Activity Knowledge Engine

Existing image-based activity understanding methods mainly adopt direct ...
research
05/27/2023

Cheating off your neighbors: Improving activity recognition through corroboration

Understanding the complexity of human activities solely through an indiv...
research
12/09/2021

Combining Textual Features for the Detection of Hateful and Offensive Language

The detection of offensive, hateful and profane language has become a cr...
research
03/17/2021

Environment and Person Independent Activity Recognition with a Commodity IEEE 802.11ac Access Point

Here, we propose an original approach for human activity recognition (HA...
research
12/12/2012

Learning with Scope, with Application to Information Extraction and Classification

In probabilistic approaches to classification and information extraction...
research
02/26/2018

Marked Self-Exciting Point Process Modelling of Information Diffusion on Twitter

Information diffusion occurs on microblogging platforms like Twitter as ...

Please sign up or login with your details

Forgot password? Click here to reset