A general cipher for individual data anonymization

12/07/2017
by   Nicolas Ruiz, et al.
0

Over the years, the literature on individual data anonymization has burgeoned in many directions. Borrowing from several areas of other sciences, the current diversity of concepts, models and tools available contributes to understanding and fostering individual data dissemination in a privacy-preserving way, as well as unleashing new sources of information for the benefits of society at large. However, such diversity doesn't come without some difficulties. Currently, the task of selecting the optimal analytical environment to conduct anonymization is complicated by the multitude of available choices. Based on recent contributions from the literature and inspired by cryptography, this paper proposes the first cipher for data anonymization. The functioning of this cipher shows that, in fact, every anonymization method can be viewed as a general form of rank swapping with unconstrained permutation structures. Beyond all the currently existing methods that it can mimic, this cipher offers a new way to practice data anonymization, notably by performing anonymization in an ex ante way, instead of being engaged in several ex post evaluations and iterations to reach the protection and information properties sought after. Moreover, the properties of this cipher point to some previously unknown general insights into the task of data anonymization considered at a general level of functioning. Finally, and to make the cipher operational, this paper proposes the introduction of permutation menus in data anonymization, where recently developed universal measures of disclosure risk and information loss are used ex ante for the calibration of permutation keys. To justify the relevance of their uses, a theoretical characterization of these measures is also proposed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2020

General Confidentiality and Utility Metrics for Privacy-Preserving Data Publishing Based on the Permutation Model

Anonymization for privacy-preserving data publishing, also known as stat...
research
12/04/2018

Hybrid Microaggregation for Privacy-Preserving Data Mining

k-Anonymity by microaggregation is one of the most commonly used anonymi...
research
09/18/2018

Model-Protected Multi-Task Learning

Multi-task learning (MTL) refers to the paradigm of learning multiple re...
research
01/04/2022

A note on the paper arXiv:2112.14547

We give historical remarks related to arXiv:2112.14547 ("A New Method of...
research
03/06/2018

Connecting Randomized Response, Post-Randomization, Differential Privacy and t-Closeness via Deniability and Permutation

We explore some novel connections between the main privacy models in use...
research
06/04/2022

A privacy preserving querying mechanism with high utility for electric vehicles

With the recent rise in awareness about advancing towards a sustainable ...
research
07/01/2015

Evaluation of Genotypic Diversity Measurements Exploited in Real-Coded Representation

Numerous genotypic diversity measures (GDMs) are available in the litera...

Please sign up or login with your details

Forgot password? Click here to reset