Fast Lempel-Ziv Decompression in Linear Space

02/28/2018
by   Philip Bille, et al.
0

We consider the problem of decompressing the Lempel-Ziv 77 representation of a string S∈[σ]^n using a working space as close as possible to the size z of the input. The folklore solution for the problem runs in optimal O(n) time but requires random access to the whole decompressed text. A better solution is to convert LZ77 into a grammar of size O(z(n/z)) and then stream S in optimal linear time. In this paper, we show that O(n) time and O(z) working space can be achieved for constant-size alphabets. On larger alphabets, we describe (i) a trade-off achieving O(n^δσ) time and O(z^1-δσ) space for any 0≤δ≤ 1, and (ii) a solution achieving optimal O(n) time and O(z n) space. Our solutions can, more generally, extract any specified subsequence of S with little overheads on top of the optimal running time and working space. As an immediate corollary, we show that our techniques yield improved results for pattern matching problems on LZ77-compressed text.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/13/2017

Closing in on Time and Space Optimal Construction of Compressed Indexes

Fast and space-efficient construction of compressed indexes such as comp...
research
03/10/2020

Optimal-size problem kernels for d-Hitting Set in linear time and space

We improve two linear-time data reduction algorithms for the d-Hitting S...
research
11/30/2018

Faster Attractor-Based Indexes

String attractors are a novel combinatorial object encompassing most kno...
research
08/12/2019

Space-Efficient Construction of Compressed Suffix Trees

We show how to build several data structures of central importance to st...
research
03/28/2022

Improving Matrix-vector Multiplication via Lossless Grammar-Compressed Matrices

As nowadays Machine Learning (ML) techniques are generating huge data co...
research
11/20/2017

On estimating the alphabet size of a discrete random source

We are concerned with estimating alphabet size N from a stream of symbol...
research
12/02/2018

Sequence Searching Allowing for Non-Overlapping Adjacent Unbalanced Translocations

Unbalanced translocations are among the most frequent chromosomal altera...

Please sign up or login with your details

Forgot password? Click here to reset