Ingesting High-Velocity Streaming Graphs from Social Media Sources

05/20/2019
by   Subhasis Dasgupta, et al.
0

Many data science applications like social network analysis use graphs as their primary form of data. However, acquiring graph-structured data from social media presents some interesting challenges. The first challenge is the high data velocity and bursty nature of the social media data. The second challenge is that the complex nature of the data makes the ingestion process expensive. If we want to store the streaming graph data in a graph database, we face a third challenge – the database is very often unable to sustain the ingestion of high-velocity, high-burst data. We have developed an adaptive buffering mechanism and a graph compression technique that effectively mitigates the problem. A novel aspect of our method is that the adaptive buffering algorithm uses the data rate, the data content as well as the CPU resources of the database machine to determine an optimal data ingestion mechanism. We further show that an ingestion-time graph-compression strategy improves the efficiency of the data ingestion into the database. We have verified the efficacy of our ingestion optimization strategy through extensive experiments.

READ FULL TEXT

page 3

page 6

research
09/12/2020

Discovering Interesting Subgraphs in Social Media Networks

Social media data are often modeled as heterogeneous graphs with multipl...
research
12/31/2016

Social Media Argumentation Mining: The Quest for Deliberateness in Raucousness

Argumentation mining from social media content has attracted increasing ...
research
06/01/2021

Parlermonium: A Data-Driven UX Design Evaluation of the Parler Platform

This paper evaluates Parler, the controversial social media platform, fr...
research
07/17/2022

Model-Agnostic and Diverse Explanations for Streaming Rumour Graphs

The propagation of rumours on social media poses an important threat to ...
research
05/05/2020

Weak ties strengthen anger contagion in social media

Increasing evidence suggests that, similar to face-to-face communication...
research
05/22/2020

OBDA for the Web: Creating Virtual RDF Graphs On Top of Web Data Sources

Due to Variety, Web data come in many different structures and formats, ...
research
03/28/2021

Mathematics of Digital Hyperspace

Social media, e-commerce, streaming video, e-mail, cloud documents, web ...

Please sign up or login with your details

Forgot password? Click here to reset