Parallel and Streaming Algorithms for K-Core Decomposition

08/07/2018
by   Hossein Esfandiari, et al.
0

The k-core decomposition is a fundamental primitive in many machine learning and data mining applications. We present the first distributed and the first streaming algorithms to compute and maintain an approximate k-core decomposition with provable guarantees. Our algorithms achieve rigorous bounds on space complexity while bounding the number of passes or number of rounds of computation. We do so by presenting a new powerful sketching technique for k-core decomposition, and then by showing it can be computed efficiently in both streaming and MapReduce models. Finally, we confirm the effectiveness of our sketching technique empirically on a number of publicly available graphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/09/2021

Optimal Space and Time for Streaming Pattern Matching

In this work, we study longest common substring, pattern matching, and w...
research
06/01/2020

Streaming Coresets for Symmetric Tensor Factorization

Factorizing tensors has recently become an important optimization module...
research
10/02/2019

Streaming Balanced Clustering

Clustering of data points in metric space is among the most fundamental ...
research
12/26/2021

K-Core Decomposition on Super Large Graphs with Limited Resources

K-core decomposition is a commonly used metric to analyze graph structur...
research
02/21/2019

Continuous Outlier Mining of Streaming Data in Flink

In this work, we focus on distance-based outliers in a metric space, whe...
research
12/12/2021

Fast computation of distance-generalized cores using sampling

Core decomposition is a classic technique for discovering densely connec...
research
01/15/2019

An Algorithm for Bounding the Probability of r-core Formation in k-uniform Random Hypergraphs

We present an algorithm for bounding the probability of r-core formation...

Please sign up or login with your details

Forgot password? Click here to reset