Reconstruction Codes for DNA Sequences with Uniform Tandem-Duplication Errors

01/18/2018
by   Yonatan Yehezkeally, et al.
0

DNA as a data storage medium has several advantages, including far greater data density compared to electronic media. We propose that schemes for data storage in the DNA of living organisms may benefit from studying the reconstruction problem, which is applicable whenever multiple reads of noisy data are available. This strategy is uniquely suited to the medium, which inherently replicates stored data in multiple distinct ways, caused by mutations. We consider noise introduced solely by uniform tandem-duplication, and utilize the relation to constant-weight integer codes in the Manhattan metric. By bounding the intersection of the cross-polytope with hyperplanes, we prove the existence of reconstruction codes with greater capacity than known error-correcting codes, which we can determine analytically for any set of parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/18/2020

Error-correcting Codes for Noisy Duplication Channels

Because of its high data density and longevity, DNA is emerging as a pro...
research
04/20/2023

DNA-Correcting Codes: End-to-end Correction in DNA Storage Systems

This paper introduces a new solution to DNA storage that integrates all ...
research
09/16/2018

Sequence-Subset Distance and Coding for Error Control in DNA Data Storage

The process of DNA data storage can be mathematically modelled as a comm...
research
10/20/2022

Robust Multi-Read Reconstruction from Contaminated Clusters Using Deep Neural Network for DNA Storage

DNA has immense potential as an emerging data storage medium. The princi...
research
01/20/2020

Uncertainty of Reconstructing Multiple Messages from Uniform-Tandem-Duplication Noise

A growing number of works have, in recent years, been concerned with in-...
research
08/22/2021

Sequence Reconstruction for Limited-Magnitude Errors

Motivated by applications to DNA storage, we study reconstruction and li...
research
10/12/2020

Trace Reconstruction Problems in Computational Biology

The problem of reconstructing a string from its error-prone copies, the ...

Please sign up or login with your details

Forgot password? Click here to reset