L-systems for Measuring Repetitiveness*

06/03/2022
by   Gonzalo Navarro, et al.
0

An L-system (for lossless compression) is a CPD0L-system extended with two parameters d and n, which determines unambiguously a string w = τ(φ^d(s))[1:n], where φ is the morphism of the system, s is its axiom, and τ is its coding. The length of the shortest description of an L-system generating w is known as ℓ, and is arguably a relevant measure of repetitiveness that builds on the self-similarities that arise in the sequence. In this paper we deepen the study of the measure ℓ and its relation with δ, a better established lower bound that builds on substring complexity. Our results show that ℓ and δ are largely orthogonal, in the sense that one can be much larger than the other depending on the case. This suggests that both sources of repetitiveness are mostly unrelated. We also show that the recently introduced NU-systems, which combine the capabilities of L-systems with bidirectional macro-schemes, can be asymptotically strictly smaller than both mechanisms, which makes the size ν of the smallest NU-system the unique smallest reachable repetitiveness measure to date.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2021

On Stricter Reachable Repetitiveness Measures*

The size b of the smallest bidirectional macro scheme, which is arguably...
research
10/04/2019

Towards a Definitive Measure of Repetitiveness

Unlike in statistical compression, where Shannon's entropy is a definiti...
research
04/19/2021

A Separation of γ and b via Thue–Morse Words

We prove that for n≥ 2, the size b(t_n) of the smallest bidirectional sc...
research
06/01/2022

String Attractors and Infinite Words

The notion of string attractor has been introduced in [Kempa and Prezza,...
research
07/06/2022

Computing NP-hard Repetitiveness Measures via MAX-SAT

Repetitiveness measures reveal profound characteristics of datasets, and...
research
10/30/2017

At the Roots of Dictionary Compression: String Attractors

A well-known fact in the field of lossless text compression is that high...
research
07/05/2023

Compressibility measures for two-dimensional data

In this paper we extend to two-dimensional data two recently introduced ...

Please sign up or login with your details

Forgot password? Click here to reset