Chasing Accuracy and Privacy, and Catching Both: A Literature Survey on Differentially Private Histogram Publication

10/30/2019
by   Boel Nelson, et al.
0

Histograms and synthetic data are of key importance in data analysis. However, researchers have shown that even aggregated data such as histograms, containing no obvious sensitive attributes, can result in privacy leakage. To enable data analysis, a strong notion of privacy is required to avoid risking unintended privacy violations. Such a strong notion of privacy is differential privacy, a statistical notion of privacy that makes privacy leakage quantifiable. The caveat regarding differential privacy is that while it has strong guarantees for privacy, privacy comes at a cost of accuracy. Despite this trade off being a central and important issue in the adoption of differential privacy, there exists a gap in the literature for understanding the trade off and addressing it appropriately. Through a systematic literature review (SLR), we investigate the state-of-the-art within accuracy improving differentially private algorithms for histogram and synthetic data publishing. Our contribution is two-fold: 1) we provide an understanding of the problem by crystallizing the categories of accuracy improving techniques, the core problems they solve, as well as to investigate how composable the techniques are, and 2) we pave the way for future work. In order to provide an understanding, we position and visualize the ideas in relation to each other and external work, and deconstruct each algorithm to examine the building blocks separately with the aim of pinpointing which dimension of noise reduction each technique is targeting. Hence, this systematization of knowledge (SoK) provides an understanding of in which dimensions and how accuracy improvement can be pursued without sacrificing privacy.

READ FULL TEXT

page 1

page 4

page 6

research
09/30/2021

Private sampling: a noiseless approach for generating differentially private synthetic data

In a world where artificial intelligence and data science become omnipre...
research
04/16/2022

Assessing Differentially Private Variational Autoencoders under Membership Inference

We present an approach to quantify and compare the privacy-accuracy trad...
research
03/08/2021

Efficient Accuracy Prediction for Differentially Private Algorithms

Differential privacy is a strong mathematical notion of privacy. Still, ...
research
10/25/2022

Differentially Private Language Models for Secure Data Sharing

To protect the privacy of individuals whose data is being shared, it is ...
research
03/05/2019

A New Approach to Adaptive Data Analysis and Learning via Maximal Leakage

There is an increasing concern that most current published research find...
research
07/18/2022

Protecting Global Properties of Datasets with Distribution Privacy Mechanisms

Alongside the rapid development of data collection and analysis techniqu...
research
12/10/2022

Adore: Differentially Oblivious Relational Database Operators

There has been a recent effort in applying differential privacy on memor...

Please sign up or login with your details

Forgot password? Click here to reset