Differential Privacy of Hierarchical Census Data: An Optimization Approach

06/28/2020
by   Ferdinando Fioretto, et al.
0

This paper is motivated by applications of a Census Bureau interested in releasing aggregate socio-economic data about a large population without revealing sensitive information about any individual. The released information can be the number of individuals living alone, the number of cars they own, or their salary brackets. Recent events have identified some of the privacy challenges faced by these organizations. To address them, this paper presents a novel differential-privacy mechanism for releasing hierarchical counts of individuals. The counts are reported at multiple granularities (e.g., the national, state, and county levels) and must be consistent across all levels. The core of the mechanism is an optimization model that redistributes the noise introduced to achieve differential privacy in order to meet the consistency constraints between the hierarchical levels. The key technical contribution of the paper shows that this optimization problem can be solved in polynomial time by exploiting the structure of its cost functions. Experimental results on very large, real datasets show that the proposed mechanism provides improvements of up to two orders of magnitude in terms of computational efficiency and accuracy with respect to other state-of-the-art techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2022

Information Design for Differential Privacy

Firms and statistical agencies that publish aggregate data face practica...
research
07/14/2021

Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

In recent years, machine learning techniques utilizing large-scale datas...
research
06/25/2022

Cactus Mechanisms: Optimal Differential Privacy Mechanisms in the Large-Composition Regime

Most differential privacy mechanisms are applied (i.e., composed) numero...
research
10/04/2017

(k,ε)-Anonymity: k-Anonymity with ε-Differential Privacy

The explosion in volume and variety of data offers enormous potential fo...
research
01/21/2019

Differential Privacy for Power Grid Obfuscation

The availability of high-fidelity energy networks brings significant val...
research
10/02/2017

Constrained Differential Privacy for Count Data

Concern about how to aggregate sensitive user data without compromising ...

Please sign up or login with your details

Forgot password? Click here to reset