Optimal LZ-End Parsing is Hard

02/06/2023
by   Hideo Bannai, et al.
0

LZ-End is a variant of the well-known Lempel-Ziv parsing family such that each phrase of the parsing has a previous occurrence, with the additional constraint that the previous occurrence must end at the end of a previous phrase. LZ-End was initially proposed as a greedy parsing, where each phrase is determined greedily from left to right, as the longest factor that satisfies the above constraint [Kreft Navarro, 2010]. In this work, we consider an optimal LZ-End parsing that has the minimum number of phrases in such parsings. We show that a decision version of computing the optimal LZ-End parsing is NP-complete by showing a reduction from the vertex cover problem. Moreover, we give a MAX-SAT formulation for the optimal LZ-End parsing adapting an approach for computing various NP-hard repetitiveness measures recently presented by [Bannai et al., 2022]. We also consider the approximation ratio of the size of greedy LZ-End parsing to the size of the optimal LZ-End parsing, and give a lower bound of the ratio which asymptotically approaches 2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/11/2018

LZRR: LZ77 Parsing with Right Reference

Lossless data compression has been widely studied in computer science. O...
research
03/26/2018

On the Approximation Ratio of Greedy Parsings

Shannon's entropy is a clear lower bound for statistical compression. Th...
research
02/15/2023

On graph-based reentrancy-free semantic parsing

We propose a novel graph-based approach for semantic parsing that resolv...
research
10/15/2019

Text2Math: End-to-end Parsing Text into Math Expressions

We propose Text2Math, a model for semantically parsing text into math ex...
research
11/07/2019

Towards Better Compressed Representations

We introduce the problem of computing a parsing where each phrase is of ...
research
06/02/2021

On the approximation ratio of LZ-End to LZ77

A family of Lempel-Ziv factorizations is a well-studied string structure...
research
06/30/2022

Prefix-free parsing for building large tunnelled Wheeler graphs

We propose a new technique for creating a space-efficient index for larg...

Please sign up or login with your details

Forgot password? Click here to reset