Evaluating Bias and Noise Induced by the U.S. Census Bureau's Privacy Protection Methods

06/13/2023
by   Christopher T. Kenny, et al.
0

The United States Census Bureau faces a difficult trade-off between the accuracy of Census statistics and the protection of individual information. We conduct the first independent evaluation of bias and noise induced by the Bureau's two main disclosure avoidance systems: the TopDown algorithm employed for the 2020 Census and the swapping algorithm implemented for the 1990, 2000, and 2010 Censuses. Our evaluation leverages the recent release of the Noisy Measure File (NMF) as well as the availability of two independent runs of the TopDown algorithm applied to the 2010 decennial Census. We find that the NMF contains too much noise to be directly useful alone, especially for Hispanic and multiracial populations. TopDown's post-processing dramatically reduces the NMF noise and produces similarly accurate data to swapping in terms of bias and noise. These patterns hold across census geographies with varying population sizes and racial diversity. While the estimated errors for both TopDown and swapping are generally no larger than other sources of Census error, they can be relatively substantial for geographies with small total populations.

READ FULL TEXT
research
05/12/2023

Making Differential Privacy Work for Census Data Users

The U.S. Census Bureau collects and publishes detailed demographic data ...
research
10/09/2020

Bias and Variance of Post-processing in Differential Privacy

Post-processing immunity is a fundamental property of differential priva...
research
01/30/2019

Distributionally Robust and Multi-Objective Nonnegative Matrix Factorization

Nonnegative matrix factorization (NMF) is a linear dimensionality reduct...
research
07/16/2019

Towards Adapting NMF Dictionaries Using Total Variability Modeling for Noise-Robust Acoustic Features

We propose an algorithm to extract noise-robust acoustic features from n...
research
12/11/2018

Faster-than-fast NMF using random projections and Nesterov iterations

Random projections have been recently implemented in Nonnegative Matrix ...
research
09/09/2022

Impacts of Census Differential Privacy for Small-Area Disease Mapping to Monitor Health Inequities

US Census Bureau (USCB) has implemented a new privacy-preserving disclos...
research
04/19/2022

The 2020 Census Disclosure Avoidance System TopDown Algorithm

The Census TopDown Algorithm (TDA) is a disclosure avoidance system usin...

Please sign up or login with your details

Forgot password? Click here to reset