Private Shotgun DNA Sequencing: A Structured Approach

03/29/2019
by   Ali Gholami, et al.
0

Current techniques in sequencing a genome allow a service provider (e.g. a sequencing company) to have full access to the genome information, and thus the privacy of individuals regarding their lifetime secret is violated. In this paper, we introduce the problem of private DNA sequencing and propose a sequencing scheme to keep the genome information private at the service provider. In this technique, rather than a single sequencer (i.e. sequencing machine), a set of multiple sequencers is used. Then to each sequencer, a set of DNA fragments belonging to a pool of individuals is assigned to be read. However, the assignment is such that each sequencer cannot reconstruct each individuals sequence. Still, when the results of the reads are collected at a data collector, the reconstruction is possible. To increase the ambiguity at the sequencers, we add the fragments of some known DNA molecules, which are still unknown to the sequencers, to the pool. In continue, we show that it is possible to recover the genome with provable privacy guarantees if the parameters are adjusted correctly.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/20/2023

Embed-Search-Align: DNA Sequence Alignment using Transformer Models

DNA sequence alignment involves assigning short DNA reads to the most pr...
research
10/10/2019

LISA: Towards Learned DNA Sequence Search

Next-generation sequencing (NGS) technologies have enabled affordable se...
research
12/24/2021

Measuring Quality of DNA Sequence Data via Degradation

We propose and apply a novel paradigm for characterization of genome dat...
research
11/16/2017

Privacy-preserving Edit Distance on Genomic Data

Suppose Alice holds a DNA sequence and Bob owns a database of DNA sequen...
research
01/28/2021

Private DNA Sequencing: Hiding Information in Discrete Noise

When an individual's DNA is sequenced, sensitive medical information bec...
research
03/18/2021

Sequencing by Emergence: Modeling and Estimation

Sequencing by Emergence (SEQE) is a new single-molecule nucleic acid (DN...
research
07/07/2020

Natural family-free genomic distance

A classical problem in comparative genomics is to compute the rearrangem...

Please sign up or login with your details

Forgot password? Click here to reset