A simple algorithm for estimating distribution parameters from n-dimensional randomized binary responses

03/11/2018
by   Staal A. Vinterbo, et al.
0

Randomized response for privacy protection is attractive as provided disclosure control can be quantified by means such as differential privacy. However, recovering statistics involving multiple dependent binary attributes can be difficult, posing a barrier to the use of randomized response for privacy protection. In this work, we identify a family of randomizers for which we are able to present a simple and efficient algorithm for obtaining unbiased maximum likelihood estimates for k-way marginal distributions from the randomized data. We also provide theoretical bounds on the statistical efficiency of these estimates, allowing the assessment of sample sizes for these randomizers. The identified family consists of randomizers generated by an iterated Kronecker product of an invertible and bisymmetric 2 x 2 matrix. This family includes modes of Google's Rappor randomizer, as well as applications of two well-known classical randomized response methods: Warner's original method, and Simmons' unrelated question method. We find that randomizers in this family can also be considered to be equivalent to each other with respect to the efficiency -- differential privacy tradeoff. Importantly, the estimation algorithm is simple to implement, an aspect critical to technologies for privacy protection and security.

READ FULL TEXT

Authors

page 1

page 2

page 3

page 4

03/06/2018

Connecting Randomized Response, Post-Randomization, Differential Privacy and t-Closeness via Deniability and Permutation

We explore some novel connections between the main privacy models in use...
11/08/2021

Distribution-Invariant Differential Privacy

Differential privacy is becoming one gold standard for protecting the pr...
06/10/2020

Learning With Differential Privacy

The leakage of data might have been an extreme effect on the personal le...
10/21/2020

Multi-Dimensional Randomized Response

In our data world, a host of not necessarily trusted controllers gather ...
10/31/2019

Context-Aware Local Differential Privacy

Local differential privacy (LDP) is a strong notion of privacy for indiv...
05/24/2020

Successive Refinement of Privacy

This work examines a novel question: how much randomness is needed to ac...
07/24/2019

Privacy Parameter Variation Using RAPPOR on a Malware Dataset

Stricter data protection regulations and the poor application of privacy...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.