Differentially Private Median Forests for Regression and Classification

06/15/2020
by   Shorya Consul, et al.
0

Random forests are a popular method for classification and regression due to their versatility. However, this flexibility can come at the cost of user privacy, since training random forests requires multiple data queries, often on small, identifiable subsets of the training data. Differentially private approaches based on extremely random trees reduce the number of queries, but can lead to low-occupancy leaf nodes which require the addition of large amounts of noise. In this paper, we propose DiPriMe forests, a novel tree-based ensemble method for regression and classification problems, that ensures differential privacy while maintaining high utility. We construct trees based on a privatized version of the median value of attributes, obtained via the exponential mechanism. The use of the noisy median encourages balanced leaf nodes, ensuring that the noise added to the parameter estimate at each leaf is not overly large. The resulting algorithm, which is appropriate for real or categorical covariates, exhibits high utility while ensuring differential privacy.

READ FULL TEXT
research
08/21/2020

Low Influence, Utility, and Independence in Differential Privacy: A Curious Case of 3 2

We study the relationship between randomized low influence functions and...
research
06/27/2019

Differentially private sub-Gaussian location estimators

We tackle the problem of estimating a location parameter with differenti...
research
05/24/2023

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Decision trees are interpretable models that are well-suited to non-line...
research
12/10/2020

Research Challenges in Designing Differentially Private Text Generation Mechanisms

Accurately learning from user data while ensuring quantifiable privacy g...
research
01/26/2020

Boosted and Differentially Private Ensembles of Decision Trees

Boosted ensemble of decision tree (DT) classifiers are extremely popular...
research
06/30/2022

Imputation under Differential Privacy

The literature on differential privacy almost invariably assumes that th...
research
03/27/2018

Privacy-preserving Prediction

Ensuring differential privacy of models learned from sensitive user data...

Please sign up or login with your details

Forgot password? Click here to reset