Leveraging Public Data for Practical Private Query Release

02/17/2021
by   Terrance Liu, et al.
0

In many statistical problems, incorporating priors can significantly improve performance. However, the use of prior knowledge in differentially private query release has remained underexplored, despite such priors commonly being available in the form of public datasets, such as previous US Census releases. With the goal of releasing statistics about a private dataset, we present PMW^Pub, which – unlike existing baselines – leverages public data drawn from a related distribution as prior information. We provide a theoretical analysis and an empirical evaluation on the American Community Survey (ACS) and ADULT datasets, which shows that our method outperforms state-of-the-art methods. Furthermore, PMW^Pub scales well to high-dimensional data domains, where running many existing methods would be computationally infeasible.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/02/2023

Choosing Public Datasets for Private Machine Learning via Gradient Subspace Distance

Differentially private stochastic gradient descent privatizes model trai...
research
06/13/2022

Private Synthetic Data with Hierarchical Structure

We study the problem of differentially private synthetic data generation...
research
12/29/2021

Differentially-Private Clustering of Easy Instances

Clustering is a fundamental problem in data analysis. In differentially ...
research
07/10/2020

New Oracle-Efficient Algorithms for Private Synthetic Data Release

We present three new algorithms for constructing differentially private ...
research
11/11/2022

Differentially Private Methods for Compositional Data

Protecting individuals' private information while still allowing modeler...
research
03/14/2022

HDPView: Differentially Private Materialized View for Exploring High Dimensional Relational Data

How can we explore the unknown properties of high-dimensional sensitive ...
research
06/14/2021

Iterative Methods for Private Synthetic Data: Unifying Framework and New Methods

We study private synthetic data generation for query release, where the ...

Please sign up or login with your details

Forgot password? Click here to reset