DeepAI AI Chat
Log In Sign Up

Overlook: Differentially Private Exploratory Visualization for Big Data

by   Pratiksha Thaker, et al.
Stanford University

Data exploration systems that provide differential privacy must manage a privacy budget that measures the amount of privacy lost across multiple queries. One effective strategy to manage the privacy budget is to compute a one-time private synopsis of the data, to which users can make an unlimited number of queries. However, existing systems using synopses are built for offline use cases, where a set of queries is known ahead of time and the system carefully optimizes a synopsis for it. The synopses that these systems build are costly to compute and may also be costly to store. We introduce Overlook, a system that enables private data exploration at interactive latencies for both data analysts and data curators. The key idea in Overlook is a virtual synopsis that can be evaluated incrementally, without extra space storage or expensive precomputation. Overlook simply executes queries using an existing engine, such as a SQL DBMS, and adds noise to their results. Because Overlook's synopses do not require costly precomputation or storage, data curators can also use Overlook to explore the impact of privacy parameters interactively. Overlook offers a rich visual query interface based on the open source Hillview system. Overlook achieves accuracy comparable to existing synopsis-based systems, while offering better performance and removing the need for extra storage.


page 10

page 11


Shrinkwrap: Differentially-Private Query Processing in Private Data Federations

A private data federation is a set of autonomous databases that share a ...

Multi-Analyst Differential Privacy for Online Query Answering

Most differentially private mechanisms are designed for the use of a sin...

Mr. Plotter: Unifying Data Reduction Techniques in Storage and Visualization Systems

As the rate of data collection continues to grow rapidly, developing vis...

Differentially Private SQL with Bounded User Contribution

Differential privacy (DP) provides formal guarantees that the output of ...

An Uncertainty Principle is a Price of Privacy-Preserving Microdata

Privacy-protected microdata are often the desired output of a differenti...

M^2M: A general method to perform various data analysis tasks from a differentially private sketch

Differential privacy is the standard privacy definition for performing a...

Non-Stochastic Private Function Evaluation

We consider private function evaluation to provide query responses based...