Linear Time Online Algorithms for Constructing Linear-size Suffix Trie

01/11/2023
by   Diptarama Hendrian, et al.
0

The suffix trees are fundamental data structures for various kinds of string processing. The suffix tree of a text string T of length n has O(n) nodes and edges, and the string label of each edge is encoded by a pair of positions in T. Thus, even after the tree is built, the input string T needs to be kept stored and random access to T is still needed. The linear-size suffix tries (LSTs), proposed by Crochemore et al. [Linear-size suffix tries, TCS 638:171-178, 2016], are a "stand-alone" alternative to the suffix trees. Namely, the LST of an input text string T of length n occupies O(n) total space, and supports pattern matching and other tasks with the same efficiency as the suffix tree without the need to store the input text string T. Crochemore et al. proposed an offline algorithm which transforms the suffix tree of T into the LST of T in O(n logσ) time and O(n) space, where σ is the alphabet size. In this paper, we present two types of online algorithms which "directly" construct the LST, from right to left, and from left to right, without constructing the suffix tree as an intermediate structure. Both algorithms construct the LST incrementally when a new symbol is read, and do not access the previously read symbols. Both of the right-to-left construction algorithm and the left-to-right construction algorithm work in O(n logσ) time and O(n) space. The main feature of our algorithms is that the input text string does not need to be stored.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2019

Online Algorithms for Constructing Linear-size Suffix Trie

The suffix trees are fundamental data structures for various kinds of st...
research
07/04/2023

Linear-time Computation of DAWGs, Symmetric Indexing Structures, and MAWs for Integer Alphabets

The directed acyclic word graph (DAWG) of a string y of length n is the ...
research
10/05/2022

Computing maximal generalized palindromes

Palindromes are popular and important objects in textual data processing...
research
09/26/2022

Inferring strings from position heaps in linear time

Position heaps are index structures of text strings used for the exact s...
research
04/09/2019

Suffix Trees, DAWGs and CDAWGs for Forward and Backward Tries

The suffix tree, DAWG, and CDAWG are fundamental indexing structures of ...
research
11/25/2020

Left Lyndon tree construction

We extend the left-to-right Lyndon factorisation of a word to the left L...
research
08/11/2023

Breaking a Barrier in Constructing Compact Indexes for Parameterized Pattern Matching

A parameterized string (p-string) is a string over an alphabet (Σ_s∪Σ_p)...

Please sign up or login with your details

Forgot password? Click here to reset