A New Algebraic Approach for String Reconstruction from Substring Compositions

01/24/2022
by   Utkarsh Gupta, et al.
0

We consider the problem of binary string reconstruction from the multiset of its substring compositions, i.e., referred to as the substring composition multiset, first introduced and studied by Acharya et al. We introduce a new algorithm for the problem of string reconstruction from its substring composition multiset which relies on the algebraic properties of the equivalent bivariate polynomial formulation of the problem. We then characterize specific algebraic conditions for the binary string to be reconstructed that guarantee the algorithm does not require any backtracking through the reconstruction, and, consequently, the time complexity is bounded polynomially. More specifically, in the case of no backtracking, our algorithm has a time complexity of O(n^2) compared to the algorithm by Acharya et al., which has a time complexity of O(n^2log(n)), where n is the length of the binary string. Furthermore, it is shown that larger sets of binary strings are uniquely reconstructable by the new algorithm and without the need for backtracking leading to codebooks of reconstruction codes that are larger, by a linear factor in size, compared to the previously known construction by Pattabiraman et al., while having O(n^2) reconstruction complexity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/14/2020

Mass Error-Correction Codes for Polymer-Based Data Storage

We consider the problem of correcting mass readout errors in information...
research
08/27/2017

Plain stopping time and conditional complexities revisited

In this paper we analyze the notion of "stopping time complexity", infor...
research
04/08/2019

String Synchronizing Sets: Sublinear-Time BWT Construction and Optimal LCE Data Structure

Burrows-Wheeler transform (BWT) is an invertible text transformation tha...
research
04/05/2020

On the Tandem Duplication Distance

A tandem duplication denotes the process of inserting a copy of a segmen...
research
07/26/2022

Maximum Weight Convex Polytope

We study the maximum weight convex polytope problem, in which the goal i...
research
06/21/2021

Computing the original eBWT faster, simpler, and with less memory

Mantaci et al. [TCS 2007] defined the eBWT to extend the definition of t...
research
01/21/2022

Insertion and Deletion Correction in Polymer-based Data Storage

Synthetic polymer-based storage seems to be a particularly promising can...

Please sign up or login with your details

Forgot password? Click here to reset