Oblivious Sampling Algorithms for Private Data Analysis

09/28/2020
by   Sajin Sasy, et al.
0

We study secure and privacy-preserving data analysis based on queries executed on samples from a dataset. Trusted execution environments (TEEs) can be used to protect the content of the data during query computation, while supporting differential-private (DP) queries in TEEs provides record privacy when query output is revealed. Support for sample-based queries is attractive due to privacy amplification since not all dataset is used to answer a query but only a small subset. However, extracting data samples with TEEs while proving strong DP guarantees is not trivial as secrecy of sample indices has to be preserved. To this end, we design efficient secure variants of common sampling algorithms. Experimentally we show that accuracy of models trained with shuffling and sampling is the same for differentially private models for MNIST and CIFAR-10, while sampling provides stronger privacy guarantees than shuffling.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2022

Cache Me If You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration

Differential privacy (DP) allows data analysts to query databases that c...
research
05/10/2023

Privacy-Preserving Recommender Systems with Synthetic Query Generation using Differentially Private Large Language Models

We propose a novel approach for developing privacy-preserving large-scal...
research
10/03/2018

Shrinkwrap: Differentially-Private Query Processing in Private Data Federations

A private data federation is a set of autonomous databases that share a ...
research
12/28/2018

Answering Range Queries Under Local Differential Privacy

Counting the fraction of a population having an input within a specified...
research
03/31/2019

KloakDB: A Platform for Analyzing Sensitive Data with K-anonymous Query Processing

A private data federation enables data owners to pool their information ...
research
11/10/2020

A Systematic Comparison of Encrypted Machine Learning Solutions for Image Classification

This work provides a comprehensive review of existing frameworks based o...
research
09/15/2023

DP-PQD: Privately Detecting Per-Query Gaps In Synthetic Data Generated By Black-Box Mechanisms

Synthetic data generation methods, and in particular, private synthetic ...

Please sign up or login with your details

Forgot password? Click here to reset