kMatrix: A Space Efficient Streaming Graph Summarization Technique

05/12/2021
by   Oshan Mudannayake, et al.
0

The amount of collected information on data repositories has vastly increased with the advent of the internet. It has become increasingly complex to deal with these massive data streams due to their sheer volume and the throughput of incoming data. Many of these data streams are mapped into graphs, which helps discover some of their properties. However, due to the difficulty in processing massive streaming graphs, they are summarized such that their properties can be approximately evaluated using the summaries. gSketch, TCM, and gMatrix are some of the major streaming graph summarization techniques. Our primary contribution is devising kMatrix, which is much more memory efficient than existing streaming graph summarization techniques. We achieved this by partitioning the allocated memory using a sample of the original graph stream. Through the experiments, we show that kMatrix can achieve a significantly less error for the queries using the same space as that of TCM and gMatrix.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/28/2021

Evaluating Complex Queries on Streaming Graphs

In this paper, we study the problem of evaluating persistent queries ove...
research
11/25/2019

A Foundation of Lazy Streaming Graphs

A streaming graph system continuously processes a stream of operations o...
research
11/17/2019

Rebalancing Learning on Evolving Data Streams

Nowadays, every device connected to the Internet generates an ever-growi...
research
10/29/2016

Diversity Promoting Online Sampling for Streaming Video Summarization

Many applications benefit from sampling algorithms where a small number ...
research
05/13/2022

Detecting Rumours with Latency Guarantees using Massive Streaming Data

Today's social networks continuously generate massive streams of data, w...
research
09/01/2023

Laminar: A New Serverless Stream-based Framework with Semantic Code Search and Code Completion

This paper introduces Laminar, a novel serverless framework based on dis...
research
12/01/2020

Audience and Streamer Participation at Scale on Twitch

Large-scale streaming platforms such as Twitch are becoming increasingly...

Please sign up or login with your details

Forgot password? Click here to reset