On Random Tree Structures, Their Entropy, and Compression

09/18/2023
by   Amirmohammad Farzaneh, et al.
0

Measuring the complexity of tree structures can be beneficial in areas that use tree data structures for storage, communication, and processing purposes. This complexity can then be used to compress tree data structures to their information-theoretic limit. Additionally, the lack of models for random generation of trees is very much felt in mathematical modeling of trees and graphs. In this paper, a number of existing tree generation models such as simply generated trees are discussed, and their information content is analysed by means of information theory and Shannon's entropy. Subsequently, a new model for generating trees based on practical appearances of trees is introduced, and an upper bound for its entropy is calculated. This model is based on selecting a random tree from possible spanning trees of graphs, which is what happens often in practice. Moving on to tree compression, we find approaches to universal tree compression of the discussed models. These approaches first transform a tree into a sequence of symbols, and then apply a dictionary-based compression method. Conditions for the universality of these method are then studied and analysed.

READ FULL TEXT
research
04/27/2021

Hypersuccinct Trees – New universal tree source codes for optimal compressed tree data structures

We present a new universal source code for unlabeled binary and ordinal ...
research
03/05/2020

Order-Preserving Key Compression for In-Memory Search Trees

We present the High-speed Order-Preserving Encoder (HOPE) for in-memory ...
research
10/17/2019

EvoZip: Efficient Compression of Large Collections of Evolutionary Trees

Phylogenetic trees represent evolutionary relationships among sets of or...
research
10/21/2019

Reasoning About Recursive Tree Traversals

Traversals are commonly seen in tree data structures, and performance-en...
research
01/27/2022

Generation Matrix: An Embeddable Matrix Representation for Hierarchical Trees

Starting from the local structures to study hierarchical trees is a comm...
research
04/03/2017

Optimizing Communication by Compression for Multi-GPU Scalable Breadth-First Searches

The Breadth First Search (BFS) algorithm is the foundation and building ...
research
10/26/2018

Lossless (and Lossy) Compression of Random Forests

Ensemble methods are among the state-of-the-art predictive modeling appr...

Please sign up or login with your details

Forgot password? Click here to reset