On the Design of Codes for DNA Computing: Secondary Structure Avoidance Codes

02/27/2023
by   Tuan Thanh Nguyen, et al.
0

In this work, we investigate a challenging problem, which has been considered to be an important criterion in designing codewords for DNA computing purposes, namely secondary structure avoidance in single-stranded DNA molecules. In short, secondary structure refers to the tendency of a single-stranded DNA sequence to fold back upon itself, thus becoming inactive in the computation process. While some design criteria that reduces the possibility of secondary structure formation has been proposed by Milenkovic and Kashyap (2006), the main contribution of this work is to provide an explicit construction of DNA codes that completely avoid secondary structure of arbitrary stem length. Formally, given codeword length n and arbitrary integer m>=2, we provide efficient methods to construct DNA codes of length n that avoid secondary structure of any stem length more than or equal to m. Particularly, when m = 3, our constructions yield a family of DNA codes of rate 1.3031 bits/nt, while the highest rate found in the prior art was 1.1609 bits/nt. In addition, for m>=3log n + 4, we provide an efficient encoder that incurs only one redundant symbol.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/29/2022

Construction of Multiple Constrained DNA Codes

DNA sequences are prone to creating secondary structures by folding back...
research
07/01/2023

Codes with Biochemical Constraints and Single Error Correction for DNA-Based Data Storage

In DNA-based data storage, DNA codes with biochemical constraints and er...
research
11/07/2021

Applications of Convolutional Codes to DNA Codes and Error-Correction

Convolutional codes are error-correcting linear codes that utilize shift...
research
01/04/2019

Efficient and Explicit Balanced Primer Codes

To equip DNA-based data storage with random-access capabilities, Yazdi e...
research
11/25/2022

On DNA Codes Over the Non-Chain Ring ℤ_4+uℤ_4+u^2ℤ_4 with u^3=1

In this paper, we present a novel design strategy of DNA codes with leng...
research
07/09/2021

Sketching and Sequence Alignment: A Rate-Distortion Perspective

Pairwise alignment of DNA sequencing data is a ubiquitous task in bioinf...
research
08/03/2021

Eliminating unwanted patterns with minimal interference

Artificial synthesis of DNA molecules is an essential part of the study ...

Please sign up or login with your details

Forgot password? Click here to reset