Developing Non-Stochastic Privacy-Preserving Policies Using Agglomerative Clustering

11/12/2019
by   Ni Ding, et al.
0

We consider a non-stochastic privacy-preserving problem in which an adversary aims to infer sensitive information S from publicly accessible data X without using statistics. We consider the problem of generating and releasing a quantization X̂ of X to minimize the privacy leakage of S to X̂ while maintaining a certain level of utility (or, inversely, the quantization loss). The variables S and S are treated as bounded and non-probabilistic, but are otherwise general. We consider two existing non-stochastic privacy measures, namely the maximum uncertainty reduction L_0(S →X̂) and the refined information I_*(S; X̂) (also called the maximin information) of S. For each privacy measure, we propose a corresponding agglomerative clustering algorithm that converges to a locally optimal quantization solution X̂ by iteratively merging elements in the alphabet of X. To instantiate the solution to this problem, we consider two specific utility measures, the worst-case resolution of X by observing X̂ and the maximal distortion of the released data X̂. We show that the value of the maximin information I_*(S; X̂) can be determined by dividing the confusability graph into connected subgraphs. Hence, I_*(S; X̂) can be reduced by merging nodes connecting subgraphs. The relation to the probabilistic information-theoretic privacy is also studied by noting that the Gács-Körner common information is the stochastic version of I_* and indicates the attainability of statistical indistinguishability.

READ FULL TEXT
research
10/26/2018

Development and Analysis of Deterministic Privacy-Preserving Policies Using Non-Stochastic Information Theory

A non-stochastic privacy metric using non-stochastic information theory ...
research
01/20/2019

A Submodularity-based Agglomerative Clustering Algorithm for the Privacy Funnel

For the privacy funnel (PF) problem, we propose an efficient iterative a...
research
02/01/2019

Privacy Against Brute-Force Inference Attacks

Privacy-preserving data release is about disclosing information about us...
research
01/27/2023

Information-Theoretic Privacy-Preserving Schemes Based On Perfect Privacy

Consider a pair of random variables (X,Y) distributed according to a giv...
research
12/29/2019

Privacy-Preserving Public Release of Datasets for Support Vector Machine Classification

We consider the problem of publicly releasing a dataset for support vect...
research
01/19/2020

Privacy-Utility Tradeoff in a Guessing Framework Inspired by Index Coding

This paper studies the tradeoff in privacy and utility in a single-trial...
research
10/10/2021

Enhancing Utility in the Watchdog Privacy Mechanism

This paper is concerned with enhancing data utility in the privacy watch...

Please sign up or login with your details

Forgot password? Click here to reset