K-Core Decomposition on Super Large Graphs with Limited Resources

12/26/2021
by   Shicheng Gao, et al.
0

K-core decomposition is a commonly used metric to analyze graph structure or study the relative importance of nodes in complex graphs. Recent years have seen rapid growth in the scale of the graph, especially in industrial settings. For example, our industrial partner runs popular social applications with billions of users and is able to gather a rich set of user data. As a result, applying K-core decomposition on large graphs has attracted more and more attention from academics and the industry. A simple but effective method to deal with large graphs is to train them in the distributed settings, and some distributed K-core decomposition algorithms are also proposed. Despite their effectiveness, we experimentally and theoretically observe that these algorithms consume too many resources and become unstable on super-large-scale graphs, especially when the given resources are limited. In this paper, we deal with those super-large-scale graphs and propose a divide-and-conquer strategy on top of the distributed K-core decomposition algorithm. We evaluate our approach on three large graphs. The experimental results show that the consumption of resources can be significantly reduced, and the calculation on large-scale graphs becomes more stable than the existing methods. For example, the distributed K-core decomposition algorithm can scale to a large graph with 136 billion edges without losing correctness with our divide-and-conquer technique.

READ FULL TEXT

page 8

page 9

research
02/12/2022

Distributed D-core Decomposition over Large Directed Graphs

Given a directed graph G and integers k and l, a D-core is the maximal s...
research
04/28/2005

k-core decomposition: a tool for the visualization of large scale networks

We use the k-core decomposition to visualize large scale complex network...
research
08/07/2018

Parallel and Streaming Algorithms for K-Core Decomposition

The k-core decomposition is a fundamental primitive in many machine lear...
research
12/12/2021

Fast computation of distance-generalized cores using sampling

Core decomposition is a classic technique for discovering densely connec...
research
04/05/2020

An Eigenspace Divide-and-Conquer Approach for Large-Scale Optimization

Divide-and-conquer-based (DC-based) evolutionary algorithms (EAs) have a...
research
12/29/2019

Divide-and-Conquer Large Scale Capacitated Arc Routing Problems with Route Cutting Off Decomposition

The capacitated arc routing problem is a very important problem with man...
research
11/16/2022

Distributed Node Covering Optimization for Large Scale Networks and Its Application on Social Advertising

Combinatorial optimizations are usually complex and inefficient, which l...

Please sign up or login with your details

Forgot password? Click here to reset