Scalable and Provably Accurate Algorithms for Differentially Private Distributed Decision Tree Learning

12/19/2020
by   Kaiwen Wang, et al.
0

This paper introduces the first provably accurate algorithms for differentially private, top-down decision tree learning in the distributed setting (Balcan et al., 2012). We propose DP-TopDown, a general privacy preserving decision tree learning algorithm, and present two distributed implementations. Our first method NoisyCounts naturally extends the single machine algorithm by using the Laplace mechanism. Our second method LocalRNM significantly reduces communication and added noise by performing local optimization at each data holder. We provide the first utility guarantees for differentially private top-down decision tree learning in both the single machine and distributed settings. These guarantees show that the error of the privately-learned decision tree quickly goes to zero provided that the dataset is sufficiently large. Our extensive experiments on real datasets illustrate the trade-offs of privacy, accuracy and generalization when learning private decision trees in the distributed setting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/24/2023

Differentially-Private Decision Trees with Probabilistic Robustness to Data Poisoning

Decision trees are interpretable models that are well-suited to non-line...
research
04/22/2019

Distributed Differentially Private Computation of Functions with Correlated Noise

Many applications of machine learning, such as human health research, in...
research
10/26/2014

Differentially- and non-differentially-private random decision trees

We consider supervised learning with random decision trees, where the tr...
research
10/06/2022

Federated Boosted Decision Trees with Differential Privacy

There is great demand for scalable, secure, and efficient privacy-preser...
research
09/21/2023

S-GBDT: Frugal Differentially Private Gradient Boosting Decision Trees

Privacy-preserving learning of gradient boosting decision trees (GBDT) h...
research
06/24/2023

Zero-Concentrated Private Distributed Learning for Nonsmooth Objective Functions

This paper develops a fully distributed differentially-private learning ...
research
01/26/2020

Boosted and Differentially Private Ensembles of Decision Trees

Boosted ensemble of decision tree (DT) classifiers are extremely popular...

Please sign up or login with your details

Forgot password? Click here to reset