Differentially Private Tree-Based Redescription Mining

12/13/2022
by   Matej Mihelčić, et al.
0

Differential privacy provides a strong form of privacy and allows preserving most of the original characteristics of the dataset. Utilizing these benefits requires one to design specific differentially private data analysis algorithms. In this work, we present three tree-based algorithms for mining redescriptions while preserving differential privacy. Redescription mining is an exploratory data analysis method for finding connections between two views over the same entities, such as phenotypes and genotypes of medical patients, for example. It has applications in many fields, including some, like health care informatics, where privacy-preserving access to data is desired. Our algorithms are the first differentially private redescription mining algorithms, and we show via experiments that, despite the inherent noise in differential privacy, it can return trustworthy results even in smaller datasets where noise typically has a stronger effect.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/13/2020

Auditing Differentially Private Machine Learning: How Private is Private SGD?

We investigate whether Differentially Private SGD offers better privacy ...
research
03/31/2020

Differentially Private Naïve Bayes Classifier using Smooth Sensitivity

With the increasing collection of users' data, protecting individual pri...
research
07/14/2021

Towards Quantifying the Carbon Emissions of Differentially Private Machine Learning

In recent years, machine learning techniques utilizing large-scale datas...
research
01/27/2022

Plume: Differential Privacy at Scale

Differential privacy has become the standard for private data analysis, ...
research
03/18/2023

The Challenge of Differentially Private Screening Rules

Linear L_1-regularized models have remained one of the simplest and most...
research
03/16/2018

Differential Privacy for Growing Databases

We study the design of differentially private algorithms for adaptive an...
research
06/22/2020

P3GM: Private High-Dimensional Data Release via Privacy Preserving Phased Generative Model

How can we release a massive volume of sensitive data while mitigating p...

Please sign up or login with your details

Forgot password? Click here to reset