Smooth Anonymity for Sparse Binary Matrices

07/13/2022
by   Hossein Esfandiari, et al.
0

When working with user data providing well-defined privacy guarantees is paramount. In this work we aim to manipulate and share an entire sparse dataset with a third party privately. In fact, differential privacy has emerged as the gold standard of privacy, however, when it comes to sharing sparse datasets, as one of our main results, we prove that any differentially private mechanism that maintains a reasonable similarity with the initial dataset is doomed to have a very weak privacy guarantee. Hence we need to opt for other privacy notions such as k-anonymity are better at preserving utility in this context. In this work we present a variation of k-anonymity, which we call smooth k-anonymity and design simple algorithms that efficiently provide smooth k-anonymity. We further perform an empirical evaluation to back our theoretical guarantees, and show that our algorithm improves the performance in downstream machine learning tasks on anonymized data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/26/2019

Automatic Discovery of Privacy-Utility Pareto Fronts

Differential privacy is a mathematical framework for privacy-preserving ...
research
01/06/2014

Differentially Private Data Releasing for Smooth Queries with Synthetic Database Output

We consider accurately answering smooth queries while preserving differe...
research
09/06/2021

Differentially-Private Fingerprinting of Relational Databases

When sharing sensitive databases with other parties, a database owner ai...
research
09/19/2019

Differentially Private Regression and Classification with Sparse Gaussian Processes

A continuing challenge for machine learning is providing methods to perf...
research
02/10/2020

Guidelines for Implementing and Auditing Differentially Private Systems

Differential privacy is an information theoretic constraint on algorithm...
research
07/04/2018

Privacy Amplification by Subsampling: Tight Analyses via Couplings and Divergences

Differential privacy comes equipped with multiple analytical tools for t...
research
03/02/2020

Generating Higher-Fidelity Synthetic Datasets with Privacy Guarantees

This paper considers the problem of enhancing user privacy in common mac...

Please sign up or login with your details

Forgot password? Click here to reset