Õ(n+poly(k))-time Algorithm for Bounded Tree Edit Distance

09/15/2022
by   Debarati Das, et al.
0

Computing the edit distance of two strings is one of the most basic problems in computer science and combinatorial optimization. Tree edit distance is a natural generalization of edit distance in which the task is to compute a measure of dissimilarity between two (unweighted) rooted trees with node labels. Perhaps the most notable recent application of tree edit distance is in NoSQL big databases, such as MongoDB, where each row of the database is a JSON document represented as a labeled rooted tree, and finding dissimilarity between two rows is a basic operation. Until recently, the fastest algorithm for tree edit distance ran in cubic time (Demaine, Mozes, Rossman, Weimann; TALG'10); however, Mao (FOCS'21) broke the cubic barrier for the tree edit distance problem using fast matrix multiplication. Given a parameter k as an upper bound on the distance, an O(n+k^2)-time algorithm for edit distance has been known since the 1980s due to the works of Myers (Algorithmica'86) and Landau and Vishkin (JCSS'88). The existence of an Õ(n+poly(k))-time algorithm for tree edit distance has been posed as an open question, e.g., by Akmal and Jin (ICALP'21), who gave a state-of-the-art Õ(nk^2)-time algorithm. In this paper, we answer this question positively.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/17/2023

Subcubic algorithm for (Unweighted) Unrooted Tree Edit Distance

The tree edit distance problem is a natural generalization of the classi...
research
06/03/2021

Breaking the Cubic Barrier for (Unweighted) Tree Edit Distance

The (unweighted) tree edit distance problem for n node trees asks to com...
research
02/09/2018

Neural Dynamic Programming for Musical Self Similarity

We present a neural sequence model designed specifically for symbolic mu...
research
04/26/2018

Edit Distance between Unrooted Trees in Cubic Time

Edit distance between trees is a natural generalization of the classical...
research
07/26/2022

Tree edit distance for hierarchical data compatible with HMIL paradigm

We define edit distance for hierarchically structured data compatible wi...
research
05/06/2021

Faster Algorithms for Bounded Tree Edit Distance

Tree edit distance is a well-studied measure of dissimilarity between ro...
research
07/26/2021

An A*-algorithm for the Unordered Tree Edit Distance with Custom Costs

The unordered tree edit distance is a natural metric to compute distance...

Please sign up or login with your details

Forgot password? Click here to reset