On Codes for the Noisy Substring Channel

02/02/2021
by   Yonatan Yehezkeally, et al.
0

We consider the problem of coding for the substring channel, in which information strings are observed only through their (multisets of) substrings. Because of applications to DNA-based data storage, due to DNA sequencing techniques, interest in this channel has renewed in recent years. In contrast to existing literature, we consider a noisy channel model, where information is subject to noise before its substrings are sampled, motivated by in-vivo storage. We study two separate noise models, substitutions or deletions. In both cases, we examine families of codes which may be utilized for error-correction and present combinatorial bounds. Through a generalization of the concept of repeat-free strings, we show that the added required redundancy due to this imperfect observation assumption is sublinear, either when the fraction of errors in the observed substring length is sufficiently small, or when that length is sufficiently long. This suggests that no asymptotic cost in rate is incurred by this channel model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2018

On Coding over Sliced Information

The interest in channel models in which the data is sent as an unordered...
research
01/20/2020

Uncertainty of Reconstructing Multiple Messages from Uniform-Tandem-Duplication Noise

A growing number of works have, in recent years, been concerned with in-...
research
12/02/2021

Achieving the Capacity of a DNA Storage Channel with Linear Coding Schemes

Due to the redundant nature of DNA synthesis and sequencing technologies...
research
11/13/2019

Single-Error Detection and Correction for Duplication and Substitution Channels

Motivated by mutation processes occurring in in-vivo DNA-storage applica...
research
01/26/2022

Adversarial Torn-paper Codes

This paper studies the adversarial torn-paper channel. This problem is m...
research
04/14/2022

Optimal Reference for DNA Synthesis

In the recent years, DNA has emerged as a potentially viable storage tec...
research
06/21/2022

Signature Codes for a Noisy Adder Multiple Access Channel

In this work, we consider q-ary signature codes of length k and size n f...

Please sign up or login with your details

Forgot password? Click here to reset