Space-Efficient Random Walks on Streaming Graphs

09/13/2022
by   Serafeim Papadias, et al.
0

Graphs in many applications, such as social networks and IoT, are inherently streaming, involving continuous additions and deletions of vertices and edges at high rates. Constructing random walks in a graph, i.e., sequences of vertices selected with a specific probability distribution, is a prominent task in many of these graph applications as well as machine learning (ML) on graph-structured data. In a streaming scenario, random walks need to constantly keep up with the graph updates to avoid stale walks and thus, performance degradation in the downstream tasks. We present Wharf, a system that efficiently stores and updates random walks on streaming graphs. It avoids a potential size explosion by maintaining a compressed, high-throughput, and low-latency data structure. It achieves (i) the succinct representation by coupling compressed purely functional binary trees and pairing functions for storing the walks, and (ii) efficient walk updates by effectively pruning the walk search space. We evaluate Wharf, with real and synthetic graphs, in terms of throughput and latency when updating random walks. The results show the high superiority of Wharf over inverted index- and tree-based baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2021

Simulating Random Walks in Random Streams

The random order graph streaming model has received significant attentio...
research
07/17/2023

From random-walks to graph-sprints: a low-latency node embedding framework on continuous-time dynamic graphs

Many real-world datasets have an underlying dynamic graph structure, whe...
research
04/17/2019

Low-Latency Graph Streaming Using Compressed Purely-Functional Trees

Due to the dynamic nature of real-world graphs, there has been a growing...
research
05/01/2018

Efficient Graph Computation for Node2Vec

Node2Vec is a state-of-the-art general-purpose feature learning method f...
research
05/23/2023

Inverse square Levy walk emerging universally in goal-oriented tasks

The Levy walk in which the frequency of occurrence of step lengths follo...
research
02/20/2020

A Graph-Based Platform for Customer Behavior Analysis using Applications' Clickstream Data

Clickstream analysis is getting more attention since the increase of usa...
research
04/02/2020

RisGraph: A Real-Time Streaming System for Evolving Graphs

Graphs in the real world are constantly changing and of large scale. In ...

Please sign up or login with your details

Forgot password? Click here to reset