Constrained Differential Privacy for Count Data

10/02/2017
by   Graham Cormode, et al.
0

Concern about how to aggregate sensitive user data without compromising individual privacy is a major barrier to greater availability of data. The model of differential privacy has emerged as an accepted model to release sensitive information while giving a statistical guarantee for privacy. Many different algorithms are possible to address different target functions. We focus on the core problem of count queries, and seek to design mechanisms to release data associated with a group of n individuals. Prior work has focused on designing mechanisms by raw optimization of a loss function, without regard to the consequences on the results. This can leads to mechanisms with undesirable properties, such as never reporting some outputs (gaps), and overreporting others (spikes). We tame these pathological behaviors by introducing a set of desirable properties that mechanisms can obey. Any combination of these can be satisfied by solving a linear program (LP) which minimizes a cost function, with constraints enforcing the properties. We focus on a particular cost function, and provide explicit constructions that are optimal for certain combinations of properties, and show a closed form for their cost. In the end, there are only a handful of distinct optimal mechanisms to choose between: one is the well-known (truncated) geometric mechanism; the second a novel mechanism that we introduce here, and the remainder are found as the solution to particular LPs. These all avoid the bad behaviors we identify. We demonstrate in a set of experiments on real and synthetic data which is preferable in practice, for different combinations of data distributions, constraints, and privacy parameters.

READ FULL TEXT

page 1

page 2

page 8

page 9

page 10

research
02/11/2022

Information Design for Differential Privacy

Firms and statistical agencies that publish aggregate data face practica...
research
06/25/2022

Cactus Mechanisms: Optimal Differential Privacy Mechanisms in the Large-Composition Regime

Most differential privacy mechanisms are applied (i.e., composed) numero...
research
11/09/2018

Towards Instance-Optimal Private Query Release

We study efficient mechanisms for the query release problem in different...
research
05/10/2022

Robust Optimization for Local Differential Privacy

We consider the setting of publishing data without leaking sensitive inf...
research
07/10/2017

Composition Properties of Inferential Privacy for Time-Series Data

With the proliferation of mobile devices and the internet of things, dev...
research
03/15/2022

Privacy-Aware Compression for Federated Data Analysis

Federated data analytics is a framework for distributed data analysis wh...
research
06/28/2020

Differential Privacy of Hierarchical Census Data: An Optimization Approach

This paper is motivated by applications of a Census Bureau interested in...

Please sign up or login with your details

Forgot password? Click here to reset