Distributed Differentially Private Mutual Information Ranking and Its Applications

09/22/2020
by   Ankit Srivastava, et al.
0

Computation of Mutual Information (MI) helps understand the amount of information shared between a pair of random variables. Automated feature selection techniques based on MI ranking are regularly used to extract information from sensitive datasets exceeding petabytes in size, over millions of features and classes. Series of one-vs-all MI computations can be cascaded to produce n-fold MI results, rapidly pinpointing informative relationships. This ability to quickly pinpoint the most informative relationships from datasets of billions of users creates privacy concerns. In this paper, we present Distributed Differentially Private Mutual Information (DDP-MI), a privacy-safe fast batch MI, across various scenarios such as feature selection, segmentation, ranking, and query expansion. This distributed implementation is protected with global model differential privacy to provide strong assurances against a wide range of privacy attacks. We also show that our DDP-MI can substantially improve the efficiency of MI calculations compared to standard implementations on a large-scale public dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/13/2021

"I need a better description”: An Investigation Into User Expectations For Differential Privacy

Despite recent widespread deployment of differential privacy, relatively...
research
06/01/2023

Better Private Linear Regression Through Better Private Feature Selection

Existing work on differentially private linear regression typically assu...
research
06/07/2023

Differentially Private Selection from Secure Distributed Computing

Given a collection of vectors x^(1),…,x^(n)∈{0,1}^d, the selection probl...
research
04/05/2023

PrivGraph: Differentially Private Graph Data Publication by Exploiting Community Information

Graph data is used in a wide range of applications, while analyzing grap...
research
01/27/2022

Plume: Differential Privacy at Scale

Differential privacy has become the standard for private data analysis, ...
research
12/30/2020

Privacy-Constrained Policies via Mutual Information Regularized Policy Gradients

As reinforcement learning techniques are increasingly applied to real-wo...

Please sign up or login with your details

Forgot password? Click here to reset