Prior-Aware Distribution Estimation for Differential Privacy

06/09/2021
by   Yuchao Tao, et al.
0

Joint distribution estimation of a dataset under differential privacy is a fundamental problem for many privacy-focused applications, such as query answering, machine learning tasks and synthetic data generation. In this work, we examine the joint distribution estimation problem given two data points: 1) differentially private answers of a workload computed over private data and 2) a prior empirical distribution from a public dataset. Our goal is to find a new distribution such that estimating the workload using this distribution is as accurate as the differentially private answer, and the relative entropy, or KL divergence, of this distribution is minimized with respect to the prior distribution. We propose an approach based on iterative optimization for solving this problem. An application of our solution won second place in the NIST 2020 Differential Privacy Temporal Map Challenge, Sprint 2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/23/2021

HDMM: Optimizing error of high-dimensional statistical queries under differential privacy

In this work we describe the High-Dimensional Matrix Mechanism (HDMM), a...
research
09/09/2019

A New Analysis of Differential Privacy's Generalization Guarantees

We give a new proof of the "transfer theorem" underlying adaptive data a...
research
06/01/2022

Differentially Private Shapley Values for Data Evaluation

The Shapley value has been proposed as a solution to many applications i...
research
07/18/2019

A Differentially Private Algorithm for Range Queries on Trajectories

We propose a novel algorithm to ensure ϵ-differential privacy for answer...
research
07/21/2023

Differentially Private Heavy Hitter Detection using Federated Analytics

In this work, we study practical heuristics to improve the performance o...
research
02/28/2018

INSPECTRE: Privately Estimating the Unseen

We develop differentially private methods for estimating various distrib...
research
03/31/2023

On Rényi Differential Privacy in Statistics-Based Synthetic Data Generation

Privacy protection with synthetic data generation often uses differentia...

Please sign up or login with your details

Forgot password? Click here to reset