Lower Bounds and Improved Algorithms for Asymmetric Streaming Edit Distance and Longest Common Subsequence

03/01/2021
by   Xin Li, et al.
0

In this paper, we study edit distance (ED) and longest common subsequence (LCS) in the asymmetric streaming model, introduced by Saks and Seshadhri [SS13]. As an intermediate model between the random access model and the streaming model, this model allows one to have streaming access to one string and random access to the other string. Our first main contribution is a systematic study of space lower bounds for ED and LCS in the asymmetric streaming model. Previously, there are no explicitly stated results in this context, although some lower bounds about LCS can be inferred from the lower bounds for longest increasing subsequence (LIS) in [SW07][GG10][EJ08]. Yet these bounds only work for large alphabet size. In this paper, we develop several new techniques to handle ED in general and LCS for small alphabet size, thus establishing strong lower bounds for both problems. In particular, our lower bound for ED provides an exponential separation between edit distance and Hamming distance in the asymmetric streaming model. Our lower bounds also extend to LIS and longest non-decreasing sequence (LNS) in the standard streaming model. Together with previous results, our bounds provide an almost complete picture for these two problems. As our second main contribution, we give improved algorithms for ED and LCS in the asymmetric streaming model. For ED, we improve the space complexity of the constant factor approximation algorithms in [FHRS20][CJLZ20] from Õ(n^δ/δ) to O(d^δ/δ 𝗉𝗈𝗅𝗒𝗅𝗈𝗀(n)), where n is the length of each string and d is the edit distance between the two strings. For LCS, we give the first 1/2+ϵ approximation algorithm with space n^δ for any constant δ>0, over a binary alphabet.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/26/2020

Asymmetric Streaming Algorithms for Edit Distance and LCS

The edit distance (ED) and longest common subsequence (LCS) are two fund...
research
07/09/2021

Optimal Space and Time for Streaming Pattern Matching

In this work, we study longest common substring, pattern matching, and w...
research
02/19/2020

Space Efficient Deterministic Approximation of String Measures

We study approximation algorithms for the following three string measure...
research
02/26/2020

Streaming with Oracle: New Streaming Algorithms for Edit Distance and LCS

The edit distance (ED) and longest common subsequence (LCS) are two fund...
research
09/03/2023

Streaming and Query Once Space Complexity of Longest Increasing Subsequence

Longest Increasing Subsequence (LIS) is a fundamental problem in combina...
research
12/24/2021

Quantum Linear Algorithm for Edit Distance Using the Word QRAM Model

Many problems that can be solved in quadratic time have bit-parallel spe...
research
10/02/2018

Sketching, Streaming, and Fine-Grained Complexity of (Weighted) LCS

We study sketching and streaming algorithms for the Longest Common Subse...

Please sign up or login with your details

Forgot password? Click here to reset