DeepAI AI Chat
Log In Sign Up

HTF: Homogeneous Tree Framework for Differentially-Private Release of Location Data

by   Sina Shaham, et al.
University of Southern California
University of Massachusetts-Boston

Mobile apps that use location data are pervasive, spanning domains such as transportation, urban planning and healthcare. Important use cases for location data rely on statistical queries, e.g., identifying hotspots where users work and travel. Such queries can be answered efficiently by building histograms. However, precise histograms can expose sensitive details about individual users. Differential privacy (DP) is a mature and widely-adopted protection model, but most approaches for DP-compliant histograms work in a data-independent fashion, leading to poor accuracy. The few proposed data-dependent techniques attempt to adjust histogram partitions based on dataset characteristics, but they do not perform well due to the addition of noise required to achieve DP. We identify density homogeneity as a main factor driving the accuracy of DP-compliant histograms, and we build a data structure that splits the space such that data density is homogeneous within each resulting partition. We show through extensive experiments on large-scale real-world data that the proposed approach achieves superior accuracy compared to existing approaches.


page 8

page 9


A Neural Database for Differentially Private Spatial Range Queries

Mobile apps and location-based services generate large amounts of locati...

Cache Me If You Can: Accuracy-Aware Inference Engine for Differentially Private Data Exploration

Differential privacy (DP) allows data analysts to query databases that c...

A Neural Approach to Spatio-Temporal Data Release with User-Level Differential Privacy

Several companies (e.g., Meta, Google) have initiated "data-for-good" pr...

The Impact of De-Identification on Single-Year-of-Age Counts in the U.S. Census

In 2020, the U.S. Census Bureau transitioned from data swapping to diffe...

Differentially-Private Publication of Origin-Destination Matrices with Intermediate Stops

Conventional origin-destination (OD) matrices record the count of trips ...

DPSNN: A Differentially Private Spiking Neural Network

Privacy-preserving is a key problem for the machine learning algorithm. ...