Differentially Private SQL with Bounded User Contribution

09/04/2019
by   Royce J Wilson, et al.
0

Differential privacy (DP) provides formal guarantees that the output of a database query does not reveal too much information about any individual present in the database. While many differentially private algorithms have been proposed in the scientific literature, there are only a few end-to-end implementations of differentially private query engines. Crucially, existing systems assume that each individual is associated with at most one database record, which is unrealistic in practice. We propose a generic and scalable method to perform differentially private aggregations on databases, even when individuals can each be associated with arbitrarily many rows. We express this method as an operator in relational algebra, and implement it in an SQL engine. To validate this system, we test the utility of typical queries on industry benchmarks, and verify its correctness with a stochastic test framework we developed. We highlight the promises and pitfalls learned when deploying such a system in practice, and we publish its core components as open-source software.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/03/2018

Shrinkwrap: Differentially-Private Query Processing in Private Data Federations

A private data federation is a set of autonomous databases that share a ...
research
06/15/2022

Towards Verifiable Differentially-Private Polling

Analyses that fulfill differential privacy provide plausible deniability...
research
08/22/2022

DP-Rewrite: Towards Reproducibility and Transparency in Differentially Private Text Rewriting

Text rewriting with differential privacy (DP) provides concrete theoreti...
research
01/05/2023

DP-SIPS: A simpler, more scalable mechanism for differentially private partition selection

Partition selection, or set union, is an important primitive in differen...
research
02/03/2020

Differentially Private k-Means Clustering with Guaranteed Convergence

Iterative clustering algorithms help us to learn the insights behind the...
research
06/22/2020

Overlook: Differentially Private Exploratory Visualization for Big Data

Data exploration systems that provide differential privacy must manage a...
research
05/25/2018

An end-to-end Differentially Private Latent Dirichlet Allocation Using a Spectral Algorithm

Latent Dirichlet Allocation (LDA) is a powerful probabilistic model used...

Please sign up or login with your details

Forgot password? Click here to reset