A biologically constrained encoding solution for long-term storage of images onto synthetic DNA

by   Melpomeni Dimopoulou, et al.

Living in the age of the digital media explosion, the amount of data that is being stored increases dramatically. However, even if existing storage systems suggest efficiency in capacity, they are lacking in durability. Hard disks, flash, tape or even optical storage have limited lifespan in the range of 5 to 20 years. Interestingly, recent studies have proven that it was possible to use synthetic DNA for the storage of digital data, introducing a strong candidate to achieve data longevity. The DNA's biological properties allows the storage of a great amount of information into an extraordinary small volume while also promising efficient storage for centuries or even longer with no loss of information. However, encoding digital data onto DNA is not obvious, because when decoding, we have to face the problem of sequencing noise robustness. Furthermore, synthesizing DNA is an expensive process and thus, controlling the compression ratio by optimizing the rate-distortion trade-off is an important challenge we have to deal with. This work proposes a coding solution for the storage of digital images onto synthetic DNA. We developed a new encoding algorithm which generates a DNA code robust to biological errors coming from the synthesis and the sequencing processes. Furthermore, thanks to an optimized allocation process the solution is able to control the compression ratio and thus the length of the synthesized DNA strand. Results show an improvement in terms of coding potential compared to previous state-of-the-art works.


A constrained Shannon-Fano entropy coder for image storage in synthetic DNA

The exponentially increasing demand for data storage has been facing mor...

Implicit Neural Multiple Description for DNA-based data storage

DNA exhibits remarkable potential as a data storage solution due to its ...

On Coding for an Abstracted Nanopore Channel for DNA Storage

In the emerging field of DNA storage, data is encoded as DNA sequences a...

Efficient approximation of DNA hybridisation using deep learning

Deoxyribonucleic acid (DNA) has shown great promise in enabling computat...

A Machine Learning-based Approach to Detect Threats in Bio-Cyber DNA Storage Systems

Data storage is one of the main computing issues of this century. Not on...

Storage Space Allocation Strategy for Digital Data with Message Importance

This paper mainly focuses on the problem of lossy compression storage fr...

A self-contained and self-explanatory DNA storage system

Current research on DNA storage usually focuses on the improvement of st...

Please sign up or login with your details

Forgot password? Click here to reset