General Confidentiality and Utility Metrics for Privacy-Preserving Data Publishing Based on the Permutation Model

10/07/2020
by   Josep Domingo-Ferrer, et al.
0

Anonymization for privacy-preserving data publishing, also known as statistical disclosure control (SDC), can be viewed under the lens of the permutation model. According to this model, any SDC method for individual data records is functionally equivalent to a permutation step plus a noise addition step, where the noise added is marginal, in the sense that it does not alter ranks. Here, we propose metrics to quantify the data confidentiality and utility achieved by SDC methods based on the permutation model. We distinguish two privacy notions: in our work, anonymity refers to subjects and hence mainly to protection against record re-identification, whereas confidentiality refers to the protection afforded to attribute values against attribute disclosure. Thus, our confidentiality metrics are useful even if using a privacy model ensuring an anonymity level ex ante. The utility metric is a general-purpose metric that can be conveniently traded off against the confidentiality metrics, because all of them are bounded between 0 and 1. As an application, we compare the utility-confidentiality trade-offs achieved by several anonymization approaches, including privacy models (k-anonymity and ϵ-differential privacy) as well as SDC methods (additive noise, multiplicative noise and synthetic data) used without privacy models.

READ FULL TEXT
research
12/29/2019

Privacy-Preserving Public Release of Datasets for Support Vector Machine Classification

We consider the problem of publicly releasing a dataset for support vect...
research
12/07/2017

A general cipher for individual data anonymization

Over the years, the literature on individual data anonymization has burg...
research
01/17/2023

Binary Mechanisms under Privacy-Preserving Noise

We study mechanism design for public-good provision under a noisy privac...
research
09/18/2018

Model-Protected Multi-Task Learning

Multi-task learning (MTL) refers to the paradigm of learning multiple re...
research
02/03/2023

Private, fair and accurate: Training large-scale, privacy-preserving AI models in radiology

Artificial intelligence (AI) models are increasingly used in the medical...
research
04/14/2020

Distributed Privacy Preserving Iterative Summation Protocols

In this paper, we study the problem of summation evaluation of secrets. ...
research
07/08/2022

Bistochastic privacy

We introduce a new privacy model relying on bistochastic matrices, that ...

Please sign up or login with your details

Forgot password? Click here to reset