Coding for Polymer-Based Data Storage

03/02/2020
by   Srilakshmi Pattabiraman, et al.
0

Motivated by polymer-based data-storage platforms that use chains of binary synthetic polymers as the recording media and read the content via tandem mass spectrometers, we propose a new family of codes that allows for both unique string reconstruction and correction of multiple mass errors. We consider two approaches: The first approach pertains to asymmetric errors and it is based on introducing redundancy that scales linearly with the number of errors and logarithmically with the length of the string. The construction allows for the string to be uniquely reconstructed based only on its erroneous substring composition multiset. The key idea behind our unique reconstruction approach is to interleave (shifted) Catalan-Bertrand paths with arbitrary binary strings and "reflect" them so as to force prefixes and suffixes of the same length to have different weights. The asymptotic code rate of the scheme is one, and decoding is accomplished via a simplified version of the backtracking algorithm used for the Turnpike problem. For symmetric errors, we use a polynomial characterization of the mass information and adapt polynomial evaluation code constructions for this setting. In the process, we develop new efficient decoding algorithms for a constant number of composition errors and show that the redundancy of the scheme scales quadratically with the number of errors and logarithmically with the codelength.

READ FULL TEXT

page 1

page 16

page 17

page 18

page 19

page 20

research
04/19/2019

Reconstruction and Error-Correction Codes for Polymer-Based Data Storage

Motivated by polymer-based data-storage platforms that use chains of bin...
research
08/31/2022

Reconstruction of a Single String from a Part of its Composition Multiset

Motivated by applications in polymer-based data storage, we study the pr...
research
01/14/2020

Mass Error-Correction Codes for Polymer-Based Data Storage

We consider the problem of correcting mass readout errors in information...
research
01/21/2022

Insertion and Deletion Correction in Polymer-based Data Storage

Synthetic polymer-based storage seems to be a particularly promising can...
research
04/12/2018

Unique Reconstruction of Coded Strings from Multiset Substring Spectra

The problem of reconstructing strings from their substring spectra has a...
research
10/21/2020

Reconstructing Mixtures of Coded Strings from Prefix and Suffix Compositions

The problem of string reconstruction from substring information has foun...
research
12/31/2019

Robust Positioning Patterns with Low Redundancy

A robust positioning pattern is a large array that allows a mobile devic...

Please sign up or login with your details

Forgot password? Click here to reset