Defending Against Membership Inference Attacks on Beacon Services

Large genomic datasets are now created through numerous activities, including recreational genealogical investigations, biomedical research, and clinical care. At the same time, genomic data has become valuable for reuse beyond their initial point of collection, but privacy concerns often hinder access. Over the past several years, Beacon services have emerged to broaden accessibility to such data. These services enable users to query for the presence of a particular minor allele in a private dataset, information that can help care providers determine if genomic variation is spurious or has some known clinical indication. However, various studies have shown that even this limited access model can leak if individuals are members in the underlying dataset. Several approaches for mitigating this vulnerability have been proposed, but they are limited in that they 1) typically rely on heuristics and 2) offer probabilistic privacy guarantees, but neglect utility. In this paper, we present a novel algorithmic framework to ensure privacy in a Beacon service setting with a minimal number of query response flips (e.g., changing a positive response to a negative). Specifically, we represent this problem as combinatorial optimization in both the batch setting (where queries arrive all at once), as well as the online setting (where queries arrive sequentially). The former setting has been the primary focus in prior literature, whereas real Beacons allow sequential queries, motivating the latter investigation. We present principled algorithms in this framework with both privacy and, in some cases, worst-case utility guarantees. Moreover, through an extensive experimental evaluation, we show that the proposed approaches significantly outperform the state of the art in terms of privacy and utility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/11/2023

Enabling Trade-offs in Privacy and Utility in Genomic Data Beacons and Summary Statistics

The collection and sharing of genomic data are becoming increasingly com...
research
01/16/2014

A Utility-Theoretic Approach to Privacy in Online Services

Online offerings such as web search, news portals, and e-commerce applic...
research
07/08/2020

Privacy and Integrity Preserving Computations with CRISP

In the digital era, users share their personal data with service provide...
research
05/18/2018

Learning to Collaborate for User-Controlled Privacy

It is becoming increasingly clear that users should own and control thei...
research
09/13/2020

Information Laundering for Model Privacy

In this work, we propose information laundering, a novel framework for e...
research
06/06/2018

MicroShare: Privacy-Preserved Medical Resource Sharing through MicroService Architecture

This paper takes up the problem of medical resource sharing through Micr...
research
05/28/2019

Privacy Vulnerabilities of Dataset Anonymization Techniques

Vast amounts of information of all types are collected daily about peopl...

Please sign up or login with your details

Forgot password? Click here to reset