Privacy-Preserving Boosting in the Local Setting

02/06/2020
by   Sen Wang, et al.
0

In machine learning, boosting is one of the most popular methods that designed to combine multiple base learners to a superior one. The well-known Boosted Decision Tree classifier, has been widely adopted in many areas. In the big data era, the data held by individual and entities, like personal images, browsing history and census information, are more likely to contain sensitive information. The privacy concern raises when such data leaves the hand of the owners and be further explored or mined. Such privacy issue demands that the machine learning algorithm should be privacy aware. Recently, Local Differential Privacy is proposed as an effective privacy protection approach, which offers a strong guarantee to the data owners, as the data is perturbed before any further usage, and the true values never leave the hands of the owners. Thus the machine learning algorithm with the private data instances is of great value and importance. In this paper, we are interested in developing the privacy-preserving boosting algorithm that a data user is allowed to build a classifier without knowing or deriving the exact value of each data samples. Our experiments demonstrate the effectiveness of the proposed boosting algorithm and the high utility of the learned classifiers.

READ FULL TEXT

page 9

page 12

research
01/25/2019

SecureBoost: A Lossless Federated Learning Framework

The protection of user privacy is an important concern in machine learni...
research
02/09/2020

Privacy-Preserving Image Classification in the Local Setting

Image data has been greatly produced by individuals and commercial vendo...
research
11/11/2019

Privacy-Preserving Gradient Boosting Decision Trees

The Gradient Boosting Decision Tree (GBDT) is a popular machine learning...
research
07/30/2019

Privacy-preserving Distributed Machine Learning via Local Randomization and ADMM Perturbation

With the proliferation of training data, distributed machine learning (D...
research
05/09/2020

Utility-aware Privacy-preserving Data Releasing

In the big data era, more and more cloud-based data-driven applications ...
research
02/22/2018

Privacy-Preserving Boosting with Random Linear Classifiers for Learning from User-Generated Data

User-generated data is crucial to predictive modeling in many applicatio...
research
10/20/2021

Privacy in Open Search: A Review of Challenges and Solutions

Privacy is of worldwide concern regarding activities and processes that ...

Please sign up or login with your details

Forgot password? Click here to reset