Streaming with Oracle: New Streaming Algorithms for Edit Distance and LCS

02/26/2020
by   Alireza Farhadi, et al.
0

The edit distance (ED) and longest common subsequence (LCS) are two fundamental problems which quantify how similar two strings are to one another. In this paper, we consider these problems in the streaming model where one string is available via oracle queries and the other string comes as a stream of characters. Our main contribution is a constant factor approximation algorithm in this setting for ED with memory O(n^δ) for any δ > 0. In addition to this, we present an upper bound of Õ(√(n)) on the memory needed to approximate ED or LCS within a factor 1+o(1) in our setting. All our algorithms run in a single pass. For approximating ED within a constant factor, we discover yet another application of triangle inequality, this time in the context of streaming algorithms. Triangle inequality has been previously used to obtain subquadratic time approximation algorithms for ED. Our technique is novel and elegantly utilizes triangle inequality to save memory at the expense of an exponential increase in the runtime.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/26/2020

Asymmetric Streaming Algorithms for Edit Distance and LCS

The edit distance (ED) and longest common subsequence (LCS) are two fund...
research
11/20/2021

Approximation Algorithms for LCS and LIS with Truly Improved Running Times

Longest common subsequence (𝖫𝖢𝖲) is a classic and central problem in com...
research
11/08/2020

The Harmonic Edit Distance

This short note introduces a new distance between strings, where the cos...
research
03/01/2021

Lower Bounds and Improved Algorithms for Asymmetric Streaming Edit Distance and Longest Common Subsequence

In this paper, we study edit distance (ED) and longest common subsequenc...
research
12/10/2021

Improved Approximation Algorithms for Dyck Edit Distance and RNA Folding

The Dyck language, which consists of well-balanced sequences of parenthe...
research
02/19/2020

Space Efficient Deterministic Approximation of String Measures

We study approximation algorithms for the following three string measure...

Please sign up or login with your details

Forgot password? Click here to reset