Compact Data Structures for Shortest Unique Substring Queries

05/30/2019
by   Takuya Mieno, et al.
0

Given a string T of length n, a substring u = T[i.. j] of T is called a shortest unique substring (SUS) for an interval [s, t] if (a) u occurs exactly once in T, (b) u contains the interval [s, t] (i.e. i ≤ s ≤ t ≤ j), and (c) every substring v of T with |v| < |u| containing [s, t] occurs at least twice in T. Given a query interval [s, t] ⊂ [1, n], the interval SUS problem is to output all the SUSs for the interval [s, t]. In this article, we propose a 4n + o(n) bits data structure answering an interval SUS query in output-sensitive O(occ) time, where occ is the number of returned SUSs. Additionally, we focus on the point SUS problem, which is the interval SUS problem for s = t. Here, we propose a (log2 3 + 1)n + o(n) bits data structure answering a point SUS query in the same output-sensitive time.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/14/2019

Shortest Unique Palindromic Substring Queries on Run-Length Encoded Strings

For a string S, a palindromic substring S[i..j] is said to be a shortest...
research
04/15/2022

Shortest Unique Palindromic Substring Queries in Semi-dynamic Settings

A palindromic substring T[i.. j] of a string T is said to be a shortest ...
research
02/25/2019

Succinct Data Structures for Families of Interval Graphs

We consider the problem of designing succinct data structures for interv...
research
09/28/2022

Shortest Beer Path Queries in Interval Graphs

Our interest is in paths between pairs of vertices that go through at le...
research
06/13/2022

KATKA: A KRAKEN-like tool with k given at query time

We describe a new tool, KATKA, that stores a phylogenetic tree T such th...
research
08/28/2020

Cache-Efficient Sweeping-Based Interval Joins for Extended Allen Relation Predicates (Extended Version)

We develop a family of efficient plane-sweeping interval join algorithms...
research
05/25/2021

Minimal unique palindromic substrings after single-character substitution

A palindrome is a string that reads the same forward and backward. A pal...

Please sign up or login with your details

Forgot password? Click here to reset