O(n n)-time text compression by LZ-style longest first substitution

06/13/2018
by   Akihiro Nishi, et al.
0

Mauer et al. [A Lempel-Ziv-style Compression Method for Repetitive Texts, PSC 2017] proposed a hybrid text compression method called LZ-LFS which has both features of Lempel-Ziv 77 factorization and longest first substitution. They showed that LZ-LFS can achieve better compression ratio for repetitive texts, compared to some state-of-the-art compression algorithms. The drawback of Mauer et al.'s method is that their LZ-LFS compression algorithm takes O(n^2) time on an input string of length n. In this paper, we show a faster LZ-LFS compression algorithm that works in O(n n) time. We also propose a simpler version of LZ-LFS that can be computed in O(n) time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/08/2017

A Grammar Compression Algorithm based on Induced Suffix Sorting

We introduce GCIS, a grammar compression algorithm based on the induced ...
research
01/13/2022

Optimal alphabet for single text compression

A text can be viewed via different representations, i.e. as a sequence o...
research
12/02/2015

Annotating Character Relationships in Literary Texts

We present a dataset of manually annotated relationships between charact...
research
02/14/2014

Authorship Analysis based on Data Compression

This paper proposes to perform authorship analysis using the Fast Compre...
research
07/07/2019

Bidirectional Text Compression in External Memory

Bidirectional compression algorithms work by substituting repeated subst...
research
10/02/2018

Graph Compression Using The Regularity Method

We are living in a world which is getting more and more interconnected a...
research
12/29/2020

A fast algorithm of the shear-compression damage model for the simulation of block caving

For underground mine, the current usual technique for ore extraction is ...

Please sign up or login with your details

Forgot password? Click here to reset