Grammar-based Compression of Unranked Trees

02/15/2018
by   Adrià Gascón, et al.
0

We introduce forest straight-line programs (FSLPs) as a compressed representation of unranked ordered node-labelled trees. FSLPs are based on the operations of forest algebra and generalize tree straight-line programs. We compare the succinctness of FSLPs with two other compression schemes for unranked trees: top dags and tree straight-line programs of first-child/next sibling encodings. Efficient translations between these formalisms are provided. Finally, we show that equality of unranked trees in the setting where certain symbols are associative or commutative can be tested in polynomial time. This generalizes previous results for testing isomorphism of compressed unordered ranked trees.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/10/2019

Entropy Bounds for Grammar-Based Tree Compressors

The definition of k^th-order empirical entropy of strings is extended to...
research
02/10/2019

Balancing Straight-Line Programs

It is shown that a context-free grammar of size m that produces a single...
research
10/17/2019

EvoZip: Efficient Compression of Large Collections of Evolutionary Trees

Phylogenetic trees represent evolutionary relationships among sets of or...
research
04/02/2019

A rearrangement distance for fully-labelled trees

The problem of comparing trees representing the evolutionary histories o...
research
04/11/2020

Grammar-compressed Self-index with Lyndon Words

We introduce a new class of straight-line programs (SLPs), named the Lyn...
research
12/15/2017

Optimal top dag compression

It is shown that for a given ordered node-labelled tree of size n and wi...
research
02/22/2019

On Transforming Narrowing Trees into Regular Tree Grammars Generating Ranges of Substitutions

The grammar representation of a narrowing tree for a syntactically deter...

Please sign up or login with your details

Forgot password? Click here to reset