Theoretical analysis of edit distance algorithms: an applied perspective

04/20/2022
by   Paul Medvedev, et al.
0

Given its status as a classic problem and its importance to both theoreticians and practitioners, edit distance provides an excellent lens through which to understand how the theoretical analysis of algorithms impacts practical implementations. From an applied perspective, the goals of theoretical analysis are to predict the empirical performance of an algorithm and to serve as a yardstick to design novel algorithms that perform well in practice. In this paper, we systematically survey the types of theoretical analysis techniques that have been applied to edit distance and evaluate the extent to which each one has achieved these two goals. These techniques include traditional worst-case analysis, worst-case analysis parametrized by edit distance or entropy or compressibility, average-case analysis, semi-random models, and advice-based models. We find that the track record is mixed. On one hand, two algorithms widely used in practice have been born out of theoretical analysis and their empirical performance is captured well by theoretical predictions. On the other hand, all the algorithms developed using theoretical analysis as a yardstick since then have not had any practical relevance. We conclude by discussing the remaining open problems and how they can be tackled.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/03/2022

The limitations of the theoretical analysis of applied algorithms

The theoretical analysis of performance has been an important tool in th...
research
06/12/2018

Indexed Dynamic Programming to boost Edit Distance and LCSS Computation

There are efficient dynamic programming solutions to the computation of ...
research
05/31/2023

Theoretical Analysis on the Efficiency of Interleaved Comparisons

This study presents a theoretical analysis on the efficiency of interlea...
research
10/20/2018

MinJoin: Efficient Edit Similarity Joins via Local Hash Minimums

In this paper we study edit similarity joins, in which we are given a se...
research
03/23/2015

A Machine Learning Approach to Predicting the Smoothed Complexity of Sorting Algorithms

Smoothed analysis is a framework for analyzing the complexity of an algo...
research
05/13/2021

Efficient Stepping Algorithms and Implementations for Parallel Shortest Paths

In this paper, we study the single-source shortest-path (SSSP) problem w...
research
05/19/2018

An optimal approximation of discrete random variables with respect to the Kolmogorov distance

We present an algorithm that takes a discrete random variable X and a nu...

Please sign up or login with your details

Forgot password? Click here to reset