Zuckerli: A New Compressed Representation for Graphs

09/02/2020
by   Luca Versari, et al.
0

Zuckerli is a scalable compression system meant for large real-world graphs. Graphs are notoriously challenging structures to store efficiently due to their linked nature, which makes it hard to separate them into smaller, compact components. Therefore, effective compression is crucial when dealing with large graphs, which can have billions of nodes and edges. Furthermore, a good compression system should give the user fast and reasonably flexible access to parts of the compressed data without requiring full decompression, which may be unfeasible on their system. Zuckerli improves multiple aspects of WebGraph, the current state-of-the-art in compressing real-world graphs, by using advanced compression techniques and novel heuristic graph algorithms. It can produce both a compressed representation for storage and one which allows fast direct access to the adjacency lists of the compressed graph without decompressing the entire graph. We validate the effectiveness of Zuckerli on real-world graphs with up to a billion nodes and 90 billion edges, conducting an extensive experimental evaluation of both compression density and decompression performance. We show that Zuckerli-compressed graphs are 10 and more than 20 comparable to that of WebGraph.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2018

Graph Compression Using Pattern Matching Techniques

Graphs can be used to represent a wide variety of data belonging to diff...
research
06/10/2023

On Improving the Cohesiveness of Graphs by Merging Nodes: Formulation, Analysis, and Algorithms

Graphs are a powerful mathematical model, and they are used to represent...
research
10/02/2018

Graph Compression Using The Regularity Method

We are living in a world which is getting more and more interconnected a...
research
06/15/2018

Efficient Graph Compression Using Huffman Coding Based Techniques

Graphs have been extensively used to represent data from various domains...
research
10/29/2020

Log(Graph): A Near-Optimal High-Performance Graph Representation

Today's graphs used in domains such as machine learning or social networ...
research
06/01/2023

ITR: A grammar-based graph compressor supporting fast neighborhood queries

Neighborhood queries are the most common queries on graphs; thus, it is ...
research
04/29/2018

Variable-Byte Encoding is Now Space-Efficient Too

The ubiquitous Variable-Byte encoding is considered one of the fastest c...

Please sign up or login with your details

Forgot password? Click here to reset