Adaptive Exact Learning in a Mixed-Up World: Dealing with Periodicity, Errors and Jumbled-Index Queries in String Reconstruction

07/17/2020
by   Ramtin Afshar, et al.
0

We study the query complexity of exactly reconstructing a string from adaptive queries, such as substring, subsequence, and jumbled-index queries. Such problems have applications, e.g., in computational biology. We provide a number of new and improved bounds for exact string reconstruction for settings where either the string or the queries are "mixed-up". For example, we show that a periodic (i.e., "mixed-up") string, S=p^kp', of smallest period p, where |p'|<|p|, can be reconstructed using O(σ|p|+ n) substring queries, where σ is the alphabet size, if n=|S| is unknown. We also show that we can reconstruct S after having been corrupted by a small number of errors d, measured by Hamming distance. In this case, we give an algorithm that uses O(dσ|p| + d|p|n/d+1) queries. In addition, we show that a periodic string can be reconstructed using 2σ⌈ n⌉ + 2|p|⌈σ⌉ subsequence queries, and that general strings can be reconstructed using 2σ⌈ n⌉ + n⌈σ⌉ subsequence queries, without knowledge of n in advance. This latter result improves the previous best, decades-old result, by Skiena and Sundaram. Finally, we believe we are the first to study the exact-learning query complexity for string reconstruction using jumbled-index queries, which are a "mixed-up" typeA of query that have received much attention of late.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2020

Substring Query Complexity of String Reconstruction

Suppose an oracle knows a string S that is unknown to us and we want to ...
research
08/31/2022

Reconstruction of a Single String from a Part of its Composition Multiset

Motivated by applications in polymer-based data storage, we study the pr...
research
08/02/2018

Reconstructing Strings from Substrings: Optimal Randomized and Average-Case Algorithms

The problem called "String reconstruction from substrings" is a mathemat...
research
01/30/2019

Faster queries for longest substring palindrome after block edit

Palindromes are important objects in strings which have been extensively...
research
06/09/2020

Faster Queries on BWT-runs Compressed Indexes

Although a significant number of compressed indexes for highly repetitiv...
research
01/13/2018

Longest Common Prefixes with k-Errors and Applications

Although real-world text datasets, such as DNA sequences, are far from b...
research
12/23/2019

Reconstruction of Strings from their Substrings Spectrum

This paper studies reconstruction of strings based upon their substrings...

Please sign up or login with your details

Forgot password? Click here to reset