SPIDER-WEB enables stable, repairable, and encryptible algorithms under arbitrary local biochemical constraints in DNA-based storage

04/06/2022
by   Haoling Zhang, et al.
0

DNA has been considered as a promising medium for storing digital information. Despite the biochemical progress in DNA synthesis and sequencing, novel coding algorithms need to be constructed under the specific constraints in DNA-based storage. Many functional operations and storage carriers were introduced in recent years, bringing in various biochemical constraints including but not confined to long single-nucleotide repeats and abnormal GC content. Existing coding algorithms are not applicable or unstable due to more local biochemical constraints and their combinations. In this paper, we design a graph-based architecture, named SPIDER-WEB, to generate corresponding graph-based algorithms under arbitrary local biochemical constraints. These generated coding algorithms could be used to encode arbitrary digital data as DNA sequences directly or served as a benchmark for the follow-up construction of coding algorithms. To further consider recovery and security issues existing in the storage field, it also provides pluggable algorithmic patches based on the generated coding algorithms: path-based correcting and mapping shuffling. They provide approaches for probabilistic error correction and symmetric encryption respectively.

READ FULL TEXT
research
12/31/2019

DNA Linear Block Codes: Generation, Error-detection and Error-correction of DNA Codeword

In modern age, the increasing complexity of computation and communicatio...
research
08/11/2023

Embracing Errors is More Efficient than Avoiding Them through Constrained Coding for DNA Data Storage

DNA is an attractive medium for digital data storage. When data is store...
research
02/15/2023

Indel Error Correction Codes for DNA Digital Data Storage and Retrieval

A procedure for storage and retrieval of Digital information in DNA stri...
research
01/09/2020

Capacity-Approaching Constrained Codes with Error Correction for DNA-Based Data Storage

We propose coding techniques that limit the length of homopolymers runs,...
research
04/10/2023

Kernel Code for DNA Digital Data Storage

The biggest challenge when using DNA as a storage medium is maintaining ...
research
03/18/2022

A constrained Shannon-Fano entropy coder for image storage in synthetic DNA

The exponentially increasing demand for data storage has been facing mor...
research
02/01/2019

Some Enumeration Problems in the Duplication-Loss Model of Genome Rearrangement

Tandem-duplication-random-loss (TDRL) is an important genome rearrangeme...

Please sign up or login with your details

Forgot password? Click here to reset