Error-Correcting Codes for Nanopore Sequencing

05/17/2023
by   Anisha Banerjee, et al.
0

Nanopore sequencers, being superior to other sequencing technologies for DNA storage in multiple aspects, have attracted considerable attention in recent times. Their high error rates however demand thorough research on practical and efficient coding schemes to enable accurate recovery of stored data. To this end, we consider a simplified model of a nanopore sequencer inspired by Mao et al., that incorporates intersymbol interference and measurement noise. Essentially, our channel model passes a sliding window of length ℓ over an input sequence, that outputs the L_1-weight of the enclosed ℓ bits and shifts by δ positions with each time step. The resulting (ℓ+1)-ary vector, termed the read vector, may also be corrupted by t substitution errors. By employing graph-theoretic techniques, we deduce that for δ=1, at least loglog n bits of redundancy are required to correct a single (t=1) substitution. Finally for ℓ≥ 3, we exploit some inherent characteristics of read vectors to arrive at an error-correcting code that is optimal up to an additive constant for this setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/07/2018

On Coding over Sliced Information

The interest in channel models in which the data is sent as an unordered...
research
09/16/2018

Sequence-Subset Distance and Coding for Error Control in DNA Data Storage

The process of DNA data storage can be mathematically modelled as a comm...
research
09/16/2018

Sequence-Subset Distance and Coding for Error Control in DNA-based Data Storage

The process of DNA-based data storage (DNA storage for short) can be mat...
research
08/18/2020

Error-correcting Codes for Noisy Duplication Channels

Because of its high data density and longevity, DNA is emerging as a pro...
research
01/27/2023

Codes for Correcting Asymmetric Adjacent Transpositions and Deletions

Owing to the vast applications in DNA-based data storage, Gabrys, Yaakob...
research
03/11/2019

Clustering-Correcting Codes

A new family of codes, called clustering-correcting codes, is presented ...
research
01/21/2022

Insertion and Deletion Correction in Polymer-based Data Storage

Synthetic polymer-based storage seems to be a particularly promising can...

Please sign up or login with your details

Forgot password? Click here to reset