Statistical modelling under differential privacy constraints: A case study in fine-scale geographical analysis with Australian Bureau of Statistics TableBuilder data

07/12/2023
by   Ewan Cameron, et al.
0

Guided by the principles of differential privacy protection the Australian Bureau of Statistics modifies the data summaries from the Australian Census provided through TableBuilder to researchers at approved institutions. This modification algorithm includes the injection of a small degree of artificial noise to every nonzero cell count followed by the suppression of very small cell counts to zero. Researchers working with small area TableBuilder outputs with a high suppression fraction have proposed various algorithmic solutions to reconciling these with less suppressed outputs from larger enclosing areas. Here we propose that a Bayesian, likelihood-based statistical approach in which the perturbation algorithm itself is explicitly represented is well suited to analyses with such randomly perturbed data. Using both real (TableBuilder) and mock datasets representing dwelling classifications in the Perth Greater Capital City Area we demonstrate the feasibility and utility of multi-scale Bayesian reconstruction of modified cell counts in a spatial setting.

READ FULL TEXT

page 16

page 17

page 19

page 20

page 21

page 24

research
08/05/2021

Perturbed M-Estimation: A Further Investigation of Robust Statistics for Differential Privacy

Differential Privacy (DP) provides an elegant mathematical framework for...
research
01/29/2021

N-grams Bayesian Differential Privacy

Differential privacy has gained popularity in machine learning as a stro...
research
09/11/2018

Usable Differential Privacy: A Case Study with PSI

Differential privacy is a promising framework for addressing the privacy...
research
09/07/2018

Differentially Private Continual Release of Graph Statistics

Motivated by understanding the dynamics of sensitive social networks ove...
research
01/24/2023

Database Reconstruction Is Not So Easy and Is Different from Reidentification

In recent years, it has been claimed that releasing accurate statistical...
research
12/17/2020

Differential privacy and noisy confidentiality concepts for European population statistics

The paper aims to give an overview of various approaches to statistical ...
research
03/21/2022

Distributed non-disclosive validation of predictive models by a modified ROC-GLM

Distributed statistical analyses provide a promising approach for privac...

Please sign up or login with your details

Forgot password? Click here to reset