Reliable and Efficient Long-Term Twitter Monitoring

05/05/2020
by   Jian Cao, et al.
0

Social media data is now widely used by many academic researchers. However, long-term social media data collection projects, which involve collecting Twitter data from Twitter's public-use APIs, often encounter various issues when they try to collect streaming social media monitoring data from local-area network servers (LANs). In this technical report, we discuss some of the issues that we have encountered in our Twitter data collection project. We present a cloud-based data collection, pre-processing, and archiving infrastructure which we argue mitigates or resolves the problems we have encountered, at minimal cloud-computing costs. We show how this approach works in different cloud computing architectures.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2022

VoynaSlov: A Data Set of Russian Social Media Activity during the 2022 Ukraine-Russia War

In this report, we describe a new data set called VoynaSlov which contai...
research
01/23/2020

The Pushshift Reddit Dataset

Social media data has become crucial to the advancement of scientific un...
research
05/01/2019

Applications of Social Media in Hydroinformatics: A Survey

Floods of research and practical applications employ social media data f...
research
06/06/2020

Social Media Analysis for Crisis Informatics in the Cloud

Social media analysis of disaster events is a critical task in crisis in...
research
03/31/2020

Social Media Mining Toolkit (SMMT)

There has been a dramatic increase in the popularity of utilizing social...
research
02/21/2022

Items from Psychometric Tests as Training Data for Personality Profiling Models of Twitter Users

Machine-learned models for author profiling in social media often rely o...
research
09/26/2014

Recommending Investors for Crowdfunding Projects

To bring their innovative ideas to market, those embarking in new ventur...

Please sign up or login with your details

Forgot password? Click here to reset