Attribute Privacy: Framework and Mechanisms

09/08/2020
by   Wanrong Zhang, et al.
0

Ensuring the privacy of training data is a growing concern since many machine learning models are trained on confidential and potentially sensitive data. Much attention has been devoted to methods for protecting individual privacy during analyses of large datasets. However in many settings, global properties of the dataset may also be sensitive (e.g., mortality rate in a hospital rather than presence of a particular patient in the dataset). In this work, we depart from individual privacy to initiate the study of attribute privacy, where a data owner is concerned about revealing sensitive properties of a whole dataset during analysis. We propose definitions to capture attribute privacy in two relevant cases where global attributes may need to be protected: (1) properties of a specific dataset and (2) parameters of the underlying distribution from which dataset is sampled. We also provide two efficient mechanisms and one inefficient mechanism that satisfy attribute privacy for these settings. We base our results on a novel use of the Pufferfish framework to account for correlations across attributes in the data, thus addressing "the challenging problem of developing Pufferfish instantiations and algorithms for general aggregate secrets" that was left open by <cit.>.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/04/2022

Dikaios: Privacy Auditing of Algorithmic Fairness via Attribute Inference Attacks

Machine learning (ML) models have been deployed for high-stakes applicat...
research
04/02/2019

Data Disclosure under Perfect Sample Privacy

Perfect data privacy seems to be in fundamental opposition to the econom...
research
09/14/2022

Data Privacy and Trustworthy Machine Learning

The privacy risks of machine learning models is a major concern when tra...
research
12/12/2017

Topology of Privacy: Lattice Structures and Information Bubbles for Inference and Obfuscation

Information has intrinsic geometric and topological structure, arising f...
research
12/01/2019

Preserving Patient Privacy while Training a Predictive Model of In-hospital Mortality

Machine learning models can be used for pattern recognition in medical d...
research
08/16/2021

NeuraCrypt is not private

NeuraCrypt (Yara et al. arXiv 2021) is an algorithm that converts a sens...
research
04/04/2020

Privacy Shadow: Measuring Node Predictability and Privacy Over Time

The structure of network data enables simple predictive models to levera...

Please sign up or login with your details

Forgot password? Click here to reset