Indexed Dynamic Programming to boost Edit Distance and LCSS Computation

06/12/2018
by   Jérémy Barbay, et al.
0

There are efficient dynamic programming solutions to the computation of the Edit Distance from S∈[1..σ]^n to T∈[1..σ]^m, for many natural subsets of edit operations, typically in time within O(nm) in the worst-case over strings of respective lengths n and m (which is likely to be optimal), and in time within O(n+m) in some special cases (e.g. disjoint alphabets). We describe how indexing the strings (in linear time), and using such an index to refine the recurrence formulas underlying the dynamic programs, yield faster algorithms in a variety of models, on a continuum of classes of instances of intermediate difficulty between the worst and the best case, thus refining the analysis beyond the worst case analysis. As a side result, we describe similar properties for the computation of the Longest Common Sub Sequence LCSS(S,T) between S and T, since it is a particular case of Edit Distance, and we discuss the application of similar algorithmic and analysis techniques for other dynamic programming solutions. More formally, we propose a parameterized analysis of the computational complexity of the Edit Distance for various set of operators and of the Longest Common Sub Sequence in function of the area of the dynamic program matrix relevant to the computation.

READ FULL TEXT
research
05/17/2020

Towards Efficient Interactive Computation of Dynamic Time Warping Distance

The dynamic time warping (DTW) is a widely-used method that allows us to...
research
06/04/2018

Adaptive Computation of the Discrete Fréchet Distance

The discrete Fréchet distance is a measure of similarity between point s...
research
04/20/2022

Theoretical analysis of edit distance algorithms: an applied perspective

Given its status as a classic problem and its importance to both theoret...
research
07/06/2020

Near-Linear Time Edit Distance for Indel Channels

We consider the following model for sampling pairs of strings: s_1 is a ...
research
07/31/2020

Improved Time Warp Edit Distance – A Parallel Dynamic Program in Linear Memory

Edit Distance is a classic family of dynamic programming problems, among...
research
02/09/2018

Neural Dynamic Programming for Musical Self Similarity

We present a neural sequence model designed specifically for symbolic mu...
research
10/08/2018

Parallelisation of a Common Changepoint Detection Method

In recent years, various means of efficiently detecting changepoints in ...

Please sign up or login with your details

Forgot password? Click here to reset