HDPView: Differentially Private Materialized View for Exploring High Dimensional Relational Data

03/14/2022
by   Fumiyuki Kato, et al.
0

How can we explore the unknown properties of high-dimensional sensitive relational data while preserving privacy? We study how to construct an explorable privacy-preserving materialized view under differential privacy. No existing state-of-the-art methods simultaneously satisfy the following essential properties in data exploration: workload independence, analytical reliability (i.e., providing error bound for each search query), applicability to high-dimensional data, and space efficiency. To solve the above issues, we propose HDPView, which creates a differentially private materialized view by well-designed recursive bisected partitioning on an original data cube, i.e., count tensor. Our method searches for block partitioning to minimize the error for the counting query, in addition to randomizing the convergence, by choosing the effective cutting points in a differentially private way, resulting in a less noisy and compact view. Furthermore, we ensure formal privacy guarantee and analytical reliability by providing the error bound for arbitrary counting queries on the materialized views. HDPView has the following desirable properties: (a) Workload independence, (b) Analytical reliability, (c) Noise resistance on high-dimensional data, (d) Space efficiency. To demonstrate the above properties and the suitability for data exploration, we conduct extensive experiments with eight types of range counting queries on eight real datasets. HDPView outperforms the state-of-the-art methods in these evaluations.

READ FULL TEXT

page 11

page 12

research
08/10/2018

Optimizing error of high-dimensional statistical queries under differential privacy

Differentially private algorithms for answering sets of predicate counti...
research
06/22/2020

P3GM: Private High-Dimensional Data Release via Privacy Preserving Phased Generative Model

How can we release a massive volume of sensitive data while mitigating p...
research
08/24/2022

DP2-Pub: Differentially Private High-Dimensional Data Publication with Invariant Post Randomization

A large amount of high-dimensional and heterogeneous data appear in prac...
research
02/24/2022

Differentially-Private Publication of Origin-Destination Matrices with Intermediate Stops

Conventional origin-destination (OD) matrices record the count of trips ...
research
11/07/2022

Private Set Generation with Discriminative Information

Differentially private data generation techniques have become a promisin...
research
07/18/2019

A Differentially Private Algorithm for Range Queries on Trajectories

We propose a novel algorithm to ensure ϵ-differential privacy for answer...
research
02/17/2021

Leveraging Public Data for Practical Private Query Release

In many statistical problems, incorporating priors can significantly imp...

Please sign up or login with your details

Forgot password? Click here to reset