Approximating Text-to-Pattern Distance via Dimensionality Reduction

02/09/2020
by   Przemysław Uznański, et al.
0

Text-to-pattern distance is a fundamental problem in string matching, where given a pattern of length m and a text of length n, over integer alphabet, we are asked to compute the distance between pattern and text at every location. The distance function can be e.g. Hamming distance or ℓ_p distance for some parameter p > 0. Almost all state-of-the-art exact and approximate algorithms developed in the past ∼ 40 years were using FFT as a black-box. In this work we present O(n/ε^2) time algorithms for (1±ε)-approximation of ℓ_2 distances, and O(n/ε^3) algorithm for approximation of Hamming and ℓ_1 distances, all without use of FFT. This is independent to the very recent development by Chan et al. [STOC 2020], where O(n/ε^2) algorithm for Hamming distances not using FFT was presented – although their algorithm is much more "combinatorial", our techniques apply to other norms than Hamming.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2018

Approximating Approximate Pattern Matching

Given a text T of length n and a pattern P of length m, the approximate ...
research
01/01/2020

Approximating Text-to-Pattern Hamming Distances

We revisit a fundamental problem in string matching: given a pattern of ...
research
05/13/2021

The Dynamic k-Mismatch Problem

The text-to-pattern Hamming distances problem asks to compute the Hammin...
research
01/28/2018

Faster Approximate(d) Text-to-Pattern L1 Distance

The problem of finding distance between pattern of length m and text of ...
research
11/10/2017

Hamming distance completeness and sparse matrix multiplication

We investigate relations between (+,) vector products for binary integer...
research
11/06/2017

FAMOUS: Fast Approximate string Matching using OptimUm search Schemes

Finding approximate occurrences of a pattern in a text using a full-text...
research
06/21/2018

Hardness and algorithmic results for the approximate cover problem

In CPM 2017, Amir et al. introduce a problem, named approximate string c...

Please sign up or login with your details

Forgot password? Click here to reset