Efficient and Explicit Balanced Primer Codes

01/04/2019
by   Yeow Meng Chee, et al.
0

To equip DNA-based data storage with random-access capabilities, Yazdi et al. (2018) prepended DNA strands with specially chosen address sequences called primers and provided certain design criteria for these primers. We provide explicit constructions of error-correcting codes that are suitable as primer addresses and equip these constructions with efficient encoding algorithms. Specifically, our constructions take cyclic or linear codes as inputs and produce sets of primers with similar error-correcting capabilities. Using certain classes of BCH codes, we obtain infinite families of primer sets of length n, minimum distance d with (d + 1) _4 n + O(1) redundant symbols. Our techniques involve reversible cyclic codes (1964), an encoding method of Tavares et al. (1971) and Knuth's balancing technique (1986). In our investigation, we also construct efficient and explicit binary balanced error-correcting codes and codes for DNA computing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/14/2018

Properties and constructions of constrained codes for DNA-based data storage

We describe properties and constructions of constraint-based codes for D...
research
04/15/2018

Mutually Uncorrelated Codes for DNA Storage

Mutually Uncorrelated (MU) codes are a class of codes in which no proper...
research
01/08/2018

Efficient Encoding/Decoding of Irreducible Words for Codes Correcting Tandem Duplications

Tandem duplication is the process of inserting a copy of a segment of DN...
research
09/16/2018

Sequence-Subset Distance and Coding for Error Control in DNA-based Data Storage

The process of DNA-based data storage (DNA storage for short) can be mat...
research
02/27/2023

On the Design of Codes for DNA Computing: Secondary Structure Avoidance Codes

In this work, we investigate a challenging problem, which has been consi...
research
05/08/2023

Data-Driven Bee Identification for DNA Strands

We study a data-driven approach to the bee identification problem for DN...
research
02/23/2020

Efficient Compression of Long Arbitrary Sequences with No Reference at the Encoder

In a distributed information application an encoder compresses an arbitr...

Please sign up or login with your details

Forgot password? Click here to reset