Genome Reconstruction Attacks Against Genomic Data-Sharing Beacons

01/24/2020
by   Kerem Ayoz, et al.
0

Sharing genome data in a privacy-preserving way stands as a major bottleneck in front of the scientific progress promised by the big data era in genomics. A community-driven protocol named genomic data-sharing beacon protocol has been widely adopted for sharing genomic data. The system aims to provide a secure, easy to implement, and standardized interface for data sharing by only allowing yes/no queries on the presence of specific alleles in the dataset. However, beacon protocol was recently shown to be vulnerable against membership inference attacks. In this paper, we show that privacy threats against genomic data sharing beacons are not limited to membership inference. We identify and analyze a novel vulnerability of genomic data-sharing beacons: genome reconstruction. We show that it is possible to successfully reconstruct a substantial part of the genome of a victim when the attacker knows the victim has been added to the beacon in a recent update. We also show that even if multiple individuals are added to the beacon during the same update, it is possible to identify the victim's genome with high confidence using traits that are easily accessible by the attacker (e.g., eye and hair color). Moreover, we show how the reconstructed genome using a beacon that is not associated with a sensitive phenotype can be used for membership inference attacks to beacons with sensitive phenotypes (i.e., HIV+). The outcome of this work will guide beacon operators on when and how to update the content of the beacon. Thus, this work will be an important attempt at helping beacon operators and participants make informed decisions.

READ FULL TEXT
research
08/17/2022

I-GWAS: Privacy-Preserving Interdependent Genome-Wide Association Studies

Genome-wide Association Studies (GWASes) identify genomic variations tha...
research
09/13/2022

Differentially Private Genomic Data Release For GWAS Reproducibility

With the rapid development of technology in genome-related fields, resea...
research
01/11/2023

Enabling Trade-offs in Privacy and Utility in Genomic Data Beacons and Summary Statistics

The collection and sharing of genomic data are becoming increasingly com...
research
10/21/2021

ML with HE: Privacy Preserving Machine Learning Inferences for Genome Studies

Preserving the privacy and security of big data in the context of cloud ...
research
03/20/2018

Big Data Challenges in Genome Informatics

In recent years, we have witnessed a dramatic data explosion in genomics...
research
02/15/2021

Reconstruction-Based Membership Inference Attacks are Easier on Difficult Problems

Membership inference attacks (MIA) try to detect if data samples were us...
research
07/20/2016

Evaluating the Strength of Genomic Privacy Metrics

The genome is a unique identifier for human individuals. The genome also...

Please sign up or login with your details

Forgot password? Click here to reset