Survey of Information Encoding Techniques for DNA

06/24/2019
by   Thomas Heinis, et al.
0

Key to DNA storage is encoding the information to a sequence of nucleotides before it can be synthesised for storage. Definition of such an encoding or mapping must adhere to multiple design restrictions. First, not all possible sequences of nucleotides can be synthesised. Homopolymers, e.g., sequences of the same nucleotide, of a length of more than two, for example, cannot be synthesised without potential errors. Similarly, the G-C content of the resulting sequences should be higher than 50%. Second, given that synthesis is expensive, the encoding must map as many bits as possible to one nucleotide. Third, the synthesis (as well as the sequencing) is error prone, leading to substitutions, deletions and insertions. An encoding must therefore be designed to be resilient to errors through error correction codes or replication. Fourth, for the purpose of computation and selective retrieval, encodings should result in substantially different sequences across all data, even for very similar data. In the following we discuss the history and evolution of encodings.

READ FULL TEXT
research
01/09/2020

Capacity-Approaching Constrained Codes with Error Correction for DNA-Based Data Storage

We propose coding techniques that limit the length of homopolymers runs,...
research
05/07/2020

Coding for Optimized Writing Rate in DNA Storage

A method for encoding information in DNA sequences is described. The met...
research
03/05/2021

Iterative DNA Coding Scheme With GC Balance and Run-Length Constraints Using a Greedy Algorithm

In this paper, we propose a novel iterative encoding algorithm for DNA s...
research
07/01/2023

Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage

In DNA-based data storage, DNA codes with biochemical constraints and er...
research
11/07/2021

Applications of Convolutional Codes to DNA Codes and Error-Correction

Convolutional codes are error-correcting linear codes that utilize shift...
research
04/07/2023

Iterative Soft Decoding Algorithm for DNA Storage Using Quality Score and Redecoding

Ever since deoxyribonucleic acid (DNA) was considered as a next-generati...
research
03/14/2020

Image Encryption Decryption Using Chaotic Logistic Mapping and DNA Encoding

In this paper we have proposed a method that uses chaotic logistic mappi...

Please sign up or login with your details

Forgot password? Click here to reset