When a Dollar Makes a BWT

08/24/2019
by   Sara Giuliani, et al.
0

The Burrows-Wheeler-Transform (BWT) is a reversible string transformation which plays a central role in text compression and is fundamental in many modern bioinformatics applications. The BWT is a permutation of the characters, which is in general better compressible and allows to answer several different query types more efficiently than the original string. It is easy to see that not every string is a BWT image, and exact characterizations of BWT images are known. We investigate a related combinatorial question. In many applications, a sentinel character is added to mark the end of the string, and thus the BWT of a string ending with contains exactly one character. We ask, given a string w, in which positions, if any, can the-character be inserted to turn w into the BWT image of a word ending with the sentinel character. We show that this depends only on the standard permutation of w and give a combinatorial characterization of such positions via this permutation. We then develop an O(n log n)-time algorithm for identifying all such positions, improving on the naive quadratic time algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/05/2018

String Attractors: Verification and Optimization

String attractors [STOC 2018] are combinatorial objects recently introdu...
research
12/22/2022

On the growth rate of polyregular functions

We consider polyregular functions, which are certain string-to-string fu...
research
11/08/2020

Scout Algorithm For Fast Substring Matching

Exact substring matching is a common task in many software applications....
research
07/10/2019

String Attractors and Combinatorics on Words

The notion of string attractor has recently been introduced in [Prezza, ...
research
06/01/2022

String Attractors and Infinite Words

The notion of string attractor has been introduced in [Kempa and Prezza,...
research
10/25/2019

Massively Parallel Algorithms for String Matching with Wildcards

We study distributed algorithms for string matching problem in presence ...
research
06/19/2019

Learning with Partially Ordered Representations

This paper examines the characterization and learning of grammars define...

Please sign up or login with your details

Forgot password? Click here to reset