Boost: Effective Caching in Differentially-Private Databases

06/28/2023
by   Kelly Kostopoulou, et al.
0

Differentially private (DP) databases can enable privacy-preserving analytics over datasets or data streams containing sensitive personal records. In such systems, user privacy is a very limited resource that is consumed by every new query, and hence must be aggressively conserved. We propose Boost, the most effective caching component for linear query workloads over DP databases. Boost builds upon private multiplicative weights (PMW), a DP mechanism that is powerful in theory but very ineffective in practice, and transforms it into a highly effective caching object, PMW-Bypass, which uses prior-query results obtained through an external DP mechanism to train a PMW to answer arbitrary future linear queries accurately and "for free" from a privacy perspective. We show that Boost with PMW-Bypass conserves significantly more budget compared to vanilla PMW and simpler cache designs: at least 1.51 - 14.25x improvement in experiments on public Covid19 and CitiBike datasets. Moreover, Boost incorporates support for range-query workloads, such as timeseries or streaming workloads, where opportunities exist to further conserve privacy budget through DP parallel composition and warm-starting of PMW state. Our work thus establishes both a coherent system design and the theoretical underpinnings for effective caching in DP databases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/28/2022

Cache Me If You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration

Differential privacy (DP) allows data analysts to query databases that c...
research
09/04/2019

Privacy Accounting and Quality Control in the Sage Differentially Private ML Platform

Companies increasingly expose machine learning (ML) models trained over ...
research
12/21/2022

Differentially Private Decentralized Optimization with Relay Communication

To address the privacy leakage problem in decentralized composite convex...
research
05/10/2023

Privacy-Preserving Recommender Systems with Synthetic Query Generation using Differentially Private Large Language Models

We propose a novel approach for developing privacy-preserving large-scal...
research
11/30/2022

An Optimized Privacy-Utility Trade-off Framework for Differentially Private Data Sharing in Blockchain-based Internet of Things

Differential private (DP) query and response mechanisms have been widely...
research
05/12/2021

A Nearly Instance-optimal Differentially Private Mechanism for Conjunctive Queries

Releasing the result size of conjunctive queries and graph pattern queri...
research
02/09/2023

Pushing the Boundaries of Private, Large-Scale Query Answering

We address the problem of efficiently and effectively answering large nu...

Please sign up or login with your details

Forgot password? Click here to reset