HTF: Homogeneous Tree Framework for Differentially-Private Release of Location Data

07/29/2021
by   Sina Shaham, et al.
0

Mobile apps that use location data are pervasive, spanning domains such as transportation, urban planning and healthcare. Important use cases for location data rely on statistical queries, e.g., identifying hotspots where users work and travel. Such queries can be answered efficiently by building histograms. However, precise histograms can expose sensitive details about individual users. Differential privacy (DP) is a mature and widely-adopted protection model, but most approaches for DP-compliant histograms work in a data-independent fashion, leading to poor accuracy. The few proposed data-dependent techniques attempt to adjust histogram partitions based on dataset characteristics, but they do not perform well due to the addition of noise required to achieve DP. We identify density homogeneity as a main factor driving the accuracy of DP-compliant histograms, and we build a data structure that splits the space such that data density is homogeneous within each resulting partition. We show through extensive experiments on large-scale real-world data that the proposed approach achieves superior accuracy compared to existing approaches.

READ FULL TEXT

page 8

page 9

research
08/03/2021

A Neural Database for Differentially Private Spatial Range Queries

Mobile apps and location-based services generate large amounts of locati...
research
11/28/2022

Cache Me If You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration

Differential privacy (DP) allows data analysts to query databases that c...
research
08/20/2022

A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy

Several companies (e.g., Meta, Google) have initiated "data-for-good" pr...
research
08/24/2023

The Impact of De-Identification on Single-Year-of-Age Counts in the U.S. Census

In 2020, the U.S. Census Bureau transitioned from data swapping to diffe...
research
02/24/2022

Differentially-Private Publication of Origin-Destination Matrices with Intermediate Stops

Conventional origin-destination (OD) matrices record the count of trips ...
research
05/24/2022

DPSNN: A Differentially Private Spiking Neural Network

Privacy-preserving is a key problem for the machine learning algorithm. ...

Please sign up or login with your details

Forgot password? Click here to reset