Pattern Discovery in Colored Strings

04/09/2020
by   Zsuzsanna Lipták, et al.
0

We consider the problem of identifying patterns of interest in colored strings. A colored string is a string in which each position is colored with one of a finite set of colors. Our task is to find substrings that always occur followed by the same color at the same distance. The problem is motivated by applications in embedded systems verification, in particular, assertion mining. The goal there is to automatically infer properties of the embedded system from the analysis of its simulation traces. We show that the number of interesting patterns is upper-bounded by O(n^2) where n is the length of the string. We introduce a baseline algorithm with O(n^2) running time which identifies all interesting patterns for all colors in the string satisfying certain minimality conditions. When one is interested in patterns related to only one color, we provide an algorithm that identifies patterns in O(n^2log n) time, but is faster than the first algorithm in practice, both on simulated and on real-world patterns.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2020

Small Longest Tandem Scattered Subsequences

We consider the problem of identifying tandem scattered subsequences wit...
research
06/26/2019

String Sanitization: A Combinatorial Approach

String data are often disseminated to support applications such as locat...
research
04/12/2016

Separating Sets of Strings by Finding Matching Patterns is Almost Always Hard

We study the complexity of the problem of searching for a set of pattern...
research
03/14/2019

The Parameterized Position Heap of a Trie

Let Σ and Π be disjoint alphabets of respective size σ and π. Two string...
research
08/21/2023

DataVinci: Learning Syntactic and Semantic String Repairs

String data is common in real-world datasets: 67.6 1.8 million real Exce...
research
12/29/2022

Matchertext: Towards Verbatim Interlanguage Embedding

Embedding text in one language within text of another is commonplace for...
research
08/21/2022

Teaching the Burrows-Wheeler Transform via the Positional Burrows-Wheeler Transform

The Burrows-Wheeler Transform (BWT) is often taught in undergraduate cou...

Please sign up or login with your details

Forgot password? Click here to reset