Properties and constructions of constrained codes for DNA-based data storage

We describe properties and constructions of constraint-based codes for DNA-based data storage which account for the maximum repetition length and AT/GC balance. We present algorithms for computing the number of sequences with maximum repetition length and AT/GC balance constraint. We describe routines for translating binary runlength limited and/or balanced strings into DNA strands, and compute the efficiency of such routines. We show that the implementation of AT/GC-balanced codes is straightforward accomplished with binary balanced codes. We present codes that account for both the maximum repetition length and AT/GC balance. We compute the redundancy difference between the binary and a fully fledged quaternary approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/04/2019

Efficient and Explicit Balanced Primer Codes

To equip DNA-based data storage with random-access capabilities, Yazdi e...
research
07/01/2023

Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage

In DNA-based data storage, DNA codes with biochemical constraints and er...
research
04/15/2018

Mutually Uncorrelated Codes for DNA Storage

Mutually Uncorrelated (MU) codes are a class of codes in which no proper...
research
10/15/2019

Optimal Codes Correcting a Single Indel / Edit for DNA-Based Data Storage

An indel refers to a single insertion or deletion, while an edit refers ...
research
10/10/2022

Generalized Unique Reconstruction from Substrings

This paper introduces a new family of reconstruction codes which is moti...
research
02/12/2019

On Conflict Free DNA Codes

DNA storage has emerged as an important area of research. The reliabilit...
research
07/02/2022

Balanced reconstruction codes for single edits

Motivated by the sequence reconstruction problem initiated by Levenshtei...

Please sign up or login with your details

Forgot password? Click here to reset