A Note on the Misinterpretation of the US Census Re-identification Attack

02/10/2022
by   Paul Francis, et al.
0

In 2018, the US Census Bureau designed a new data reconstruction and re-identification attack and tested it against their 2010 data release. The specific attack executed by the Bureau allows an attacker to infer the race and ethnicity of respondents with average 75 assuming that the attacker knows the correct age, sex, and address of the respondents. They interpreted the attack as exceeding the Bureau's privacy standards, and so introduced stronger privacy protections for the 2020 Census in the form of the TopDown Algorithm (TDA). This paper demonstrates that race and ethnicity can be inferred from the TDA-protected census data with substantially better precision and recall, using less prior knowledge: only the respondents' address. Race and ethnicity can be inferred with average 75 inferred with 100 by simply assuming that the race/ethnicity of the respondent is that of the majority race/ethnicity for the respondent's census block. The conclusion to draw from this simple demonstration is NOT that the Bureau's data releases lack adequate privacy protections. Indeed it is the purpose of the data releases to allow this kind of inference. The problem, rather, is that the Bureau's criteria for measuring privacy is flawed and overly pessimistic.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2019

A note on the Uniformed Patroller Game

Patrolling Games were introduced by Alpern, Morton and Papadaki (2011) t...
research
10/28/2019

Ready, set, Go! Data-race detection and the Go language

Data races are often discussed in the context of lock acquisition and re...
research
05/21/2020

Everything is a Race and Nakamoto Always Wins

Nakamoto invented the longest chain protocol, and claimed its security b...
research
05/05/2018

Predicting Race and Ethnicity From the Sequence of Characters in a Name

To answer questions about racial inequality, we often need a way to infe...
research
10/31/2019

Reducing audio membership inference attack accuracy to chance: 4 defenses

It is critical to understand the privacy and robustness vulnerabilities ...
research
05/29/2021

The Impact of the U.S. Census Disclosure Avoidance System on Redistricting and Voting Rights Analysis

The US Census Bureau plans to protect the privacy of 2020 Census respond...
research
08/23/2022

Towards cumulative race time regression in sports: I3D ConvNet transfer learning in ultra-distance running events

Predicting an athlete's performance based on short footage is highly cha...

Please sign up or login with your details

Forgot password? Click here to reset