String Attractors for Automatic Sequences

12/12/2020
by   Luke Schaeffer, et al.
0

We show that it is decidable, given an automatic sequence s and a constant c, whether all prefixes of s have a string attractor of size ≤ c. Using a decision procedure based on this result, we show that all prefixes of the period-doubling sequence of length ≥ 2 have a string attractor of size 2. We also prove analogous results for other sequences, including the Thue-Morse sequence and the Tribonacci sequence. We also provide general upper and lower bounds on string attractor size for different kinds of sequences. For example, if s has a finite appearance constant, then there is a string attractor for s[0..n-1] of size O(log n). If further s is linearly recurrent, then there is a string attractor for s[0..n-1] of size O(1). For automatic sequences, the size of the smallest string attractor for s[0..n-1] is either Θ(1) or Θ(log n), and it is decidable which case occurs. Finally, we close with some remarks about greedy string attractors.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/19/2021

A Separation of γ and b via Thue–Morse Words

We prove that for n≥ 2, the size b(t_n) of the smallest bidirectional sc...
research
08/17/2021

Arbitrary-length analogs to de Bruijn sequences

Let α be a length-L cyclic sequence of characters from a size-K alphabet...
research
06/30/2023

Should you marginalize over possible tokenizations?

Autoregressive language models (LMs) map token sequences to probabilitie...
research
06/28/2022

Subsequences With Gap Constraints: Complexity Bounds for Matching and Analysis Problems

We consider subsequences with gap constraints, i.e., length-k subsequenc...
research
06/27/2022

Balancing Run-Length Straight-Line Programs*

It was recently proved that any SLP generating a given string w can be t...
research
03/09/2020

Smoothed Analysis of Trie Height by Star-like PFAs

Tries are general purpose data structures for information retrieval. The...
research
03/23/2023

On Constant-Weight Binary B_2-Sequences

Motivated by applications in polymer-based data storage we introduced th...

Please sign up or login with your details

Forgot password? Click here to reset