Heuristic Algorithm for Generalized Function Matching

08/05/2019
by   Radu Stefan Mincu, et al.
0

The problem of generalized function matching can be defined as follows: given a pattern p=p_1 ... p_m and a text t=t_1 ... t_n, find a mapping f:Σ_p→Σ_t^* and all text locations i such that f(p_1) f(p_2) ... f(p_m) = t_i ... t_j, a substring of t. By modifying the restrictions of the matching function f, one can obtain different matching problems, many of which have important applications. When f:Σ_p→Σ_t we are faced with problems found in the well-established field of combinatorial pattern matching. If the single character constraint is lifted and f:Σ_p→Σ_t^*, we obtain generalized function matching as introduced by Amir and Nor (JDA 2007). If we further constrain f to be injective, then we arrive at generalized parametrized matching as defined by Clifford et al. (SPIRE 2009). There are a number of important applications for pattern matching in computational biology, text editors and data compression, to name a few. Therefore, many efficient algorithms have been developed for a wide variety of specific problems including finding tandem repeats in DNA sequences, optimizing embedded systems by reusing code etc. In this work we present a heuristic algorithm illustrating a practical approach to tackling a variant of generalized function matching where f:Σ_p→Σ_t^+ and demonstrate its performance on human-produced text as well as random strings.

READ FULL TEXT
research
06/19/2023

Efficient Parameterized Pattern Matching in Sublinear Space

The parameterized matching problem is a variant of string matching, whic...
research
05/22/2019

Cartesian Tree Matching and Indexing

We introduce a new metric of match, called Cartesian tree matching, whic...
research
07/17/2022

On the Practical Power of Automata in Pattern Matching

The classical pattern matching paradigm is that of seeking occurrences o...
research
02/27/2022

Parallel algorithm for pattern matching problems under substring consistent equivalence relations

Given a text and a pattern over an alphabet, the pattern matching proble...
research
08/31/2022

Pattern matching under DTW distance

In this work, we consider the problem of pattern matching under the dyna...
research
09/17/2019

Generalized Dictionary Matching under Substring Consistent Equivalence Relations

Given a set of patterns called a dictionary and a text, the dictionary m...
research
04/20/2022

Fast Circular Pattern Matching

The Exact Circular Pattern Matching (ECPM) problem consists of reporting...

Please sign up or login with your details

Forgot password? Click here to reset