Efficient Graph Compression Using Huffman Coding Based Techniques

Graphs have been extensively used to represent data from various domains. In the era of Big Data, information is being generated at a fast pace, and analyzing the same is a challenge. Various methods have been proposed to speed up the analysis of the data and also mining it for information. All of this often involves using a massive array of compute nodes, and transmitting the data over the network. Of course, with the huge quantity of data, this poses a major issue to the task of gathering intelligence from data. Therefore, in order to address such issues with Big Data, using data compression techniques is a viable option. Since graphs represent most real world data, methods to compress graphs have been in the forefront of such endeavors. In this paper we propose techniques to compress graphs by finding specific patterns and replacing those with identifiers that are of variable length, an idea inspired by Huffman Coding. Specifically, given a graph G = (V, E), where V is the set of vertices and E is the set of edges, and |V| = n, we propose methods to reduce the space requirements of the graph by compressing the adjacency representation of the same. The proposed methods show up to 80 the space required to store the graphs as compared to using the adjacency matrix. The methods can also be applied to other representations as well. The proposed techniques help solve the issues related to computing on the graphs on resources limited compute nodes, as well as reduce the latency for transfer of data over the network in case of distributed computing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/05/2018

Graph Compression Using Pattern Matching Techniques

Graphs can be used to represent a wide variety of data belonging to diff...
research
09/02/2020

Zuckerli: A New Compressed Representation for Graphs

Zuckerli is a scalable compression system meant for large real-world gra...
research
05/28/2021

SLGCN: Structure Learning Graph Convolutional Networks for Graphs under Heterophily

The performances of GNNs for representation learning on the graph-struct...
research
12/20/2022

MDL-based Compressing Sequential Rules

Nowadays, with the rapid development of the Internet, the era of big dat...
research
02/02/2019

Big Data and Geospatial Analysis

Perhaps one of the mostly hotly debated topics in recent years has been ...
research
03/19/2023

Data assimilation for sparsification of reaction diffusion systems in a complex network

The study focuses on complex networks that are underlying graphs with an...
research
12/18/2020

Fast and Efficient Parallel Breadth-First Search with Power-law Graph Transformation

In the big data era, graph computing is widely used to exploit the hidde...

Please sign up or login with your details

Forgot password? Click here to reset