Differentially Private Genomic Data Release For GWAS Reproducibility

09/13/2022
by   Yuzhou Jiang, et al.
0

With the rapid development of technology in genome-related fields, researchers have proposed various approaches and algorithms in recent years. However, they rarely publish the genomic datasets they used in their works for others to reproduce and validate their methods, as sharing those data directly can lead to significant privacy risks (e.g., against inference attacks). To solve the problem and expedite cooperative scientific research, we propose a novel differentially private sharing mechanism for genomic datasets that protects the entire genomic dataset under differential privacy. To improve data utility of the GWAS statistics, we further develop a post-processing scheme that performs optimal transport (OT) on the empirical distributions of SNP values. The distributions are also achieved in a privacy-preserving manner. We evaluate our approach on several real genomic datasets and show in the experiments that it provides better protection against both genomic and machine learning-based membership inference attacks and offers higher GWAS utility than the baseline approaches.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2021

Generalization Techniques Empirically Outperform Differential Privacy against Membership Inference

Differentially private training algorithms provide protection against on...
research
06/23/2023

Differentially Private Streaming Data Release under Temporal Correlations via Post-processing

The release of differentially private streaming data has been extensivel...
research
04/10/2022

Differentially Private Fingerprinting for Location Trajectories

Location-based services have brought significant convenience to people i...
research
08/26/2022

Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy

Differential privacy mechanisms are increasingly used to enable public r...
research
08/04/2022

Differentially Private Counterfactuals via Functional Mechanism

Counterfactual, serving as one emerging type of model explanation, has a...
research
01/24/2020

Genome Reconstruction Attacks Against Genomic Data-Sharing Beacons

Sharing genome data in a privacy-preserving way stands as a major bottle...
research
03/20/2023

Differentially Private Algorithms for Synthetic Power System Datasets

While power systems research relies on the availability of real-world ne...

Please sign up or login with your details

Forgot password? Click here to reset