Differentially Private Database Release via Kernel Mean Embeddings

10/04/2017
by   Matej Balog, et al.
0

We lay theoretical foundations for new database release mechanisms that allow third-parties to construct consistent estimators of population statistics, while ensuring that the privacy of each individual contributing to the database is protected. The proposed framework rests on two main ideas. First, releasing (an estimate of) the kernel mean embedding of the data generating random variable instead of the database itself still allows third-parties to construct consistent estimators of a wide class of population statistics. Second, the algorithm can satisfy the definition of differential privacy by basing the released kernel mean embedding on entirely synthetic data points, while controlling accuracy through the metric available in a Reproducing Kernel Hilbert Space. We describe two instantiations of the proposed framework, suitable under different scenarios, and prove theoretical results guaranteeing differential privacy of the resulting algorithms and the consistency of estimators constructed from their outputs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/22/2019

Privacy-preserving parametric inference: a case for robust statistics

Differential privacy is a cryptographically-motivated approach to privac...
research
01/06/2014

Differentially Private Data Releasing for Smooth Queries with Synthetic Database Output

We consider accurately answering smooth queries while preserving differe...
research
03/31/2023

On Rényi Differential Privacy in Statistics-Based Synthetic Data Generation

Privacy protection with synthetic data generation often uses differentia...
research
09/06/2021

Differentially-Private Fingerprinting of Relational Databases

When sharing sensitive databases with other parties, a database owner ai...
research
03/17/2021

Accuracy Gains from Privacy Amplification Through Sampling for Differential Privacy

Recent research in differential privacy demonstrated that (sub)sampling ...
research
08/26/2022

Epistemic Parity: Reproducibility as an Evaluation Metric for Differential Privacy

Differential privacy mechanisms are increasingly used to enable public r...
research
01/24/2023

Database Reconstruction Is Not So Easy and Is Different from Reidentification

In recent years, it has been claimed that releasing accurate statistical...

Please sign up or login with your details

Forgot password? Click here to reset