Random (Un)rounding : Vulnerabilities in Discrete Attribute Disclosure in the 2021 Canadian Census

07/25/2023
by   Christopher West, et al.
0

The 2021 Canadian census is notable for using a unique form of privacy, random rounding, which independently and probabilistically rounds discrete numerical attribute values. In this work, we explore how hierarchical summative correlation between discrete variables allows for both probabilistic and exact solutions to attribute values in the 2021 Canadian Census disclosure. We demonstrate that, in some cases, it is possible to "unround" and extract the original private values before rounding, both in the presence and absence of provided population invariants. Using these methods, we expose the exact value of 624 previously private attributes in the 2021 Canadian census disclosure. We also infer the potential values of more than 1000 private attributes with a high probability of correctness. Finally, we propose how a simple solution based on unbounded discrete noise can effectively negate exact unrounding while maintaining high utility in the final product.

READ FULL TEXT

page 1

page 5

page 11

research
09/15/2020

Multimodal Joint Attribute Prediction and Value Extraction for E-commerce Product

Product attribute values are essential in many e-commerce scenarios, suc...
research
08/15/2016

Attribute Extraction from Product Titles in eCommerce

This paper presents a named entity extraction system for detecting attri...
research
10/13/2022

Consistency and Accuracy of CelebA Attribute Values

We report the first analysis of the experimental foundations of facial a...
research
05/13/2018

AttriGuard: A Practical Defense Against Attribute Inference Attacks via Adversarial Machine Learning

Users in various web and mobile applications are vulnerable to attribute...
research
10/04/2019

Energy Resource Control via Privacy Preserving Data

Although the frequent monitoring of smart meters enables granular contro...
research
06/28/2022

Simple and Effective Knowledge-Driven Query Expansion for QA-Based Product Attribute Extraction

A key challenge in attribute value extraction (AVE) from e-commerce site...
research
09/15/2021

Random Sampling Plus Fake Data: Multidimensional Frequency Estimates With Local Differential Privacy

With local differential privacy (LDP), users can privatize their data an...

Please sign up or login with your details

Forgot password? Click here to reset