Privacy-Preserving Boosting with Random Linear Classifiers for Learning from User-Generated Data

by   Sagar Sharma, et al.

User-generated data is crucial to predictive modeling in many applications. With a web/mobile/wearable interface, an online service provider (SP) can continuously record user-generated data and depend on various predictive models learned from the data to improve their services and revenue. SPs owning the large collection of user-generated data has raised privacy concerns. We present a privacy-preserving framework, SecureBoost, which allows users to submit encrypted or randomly masked data to SP who learn only prediction models but nothing else. Our framework utilizes random linear classifiers (RLCs) as the base classifiers in the boosting framework to simplify the design of privacy-preserving protocol. A Cryptographic Service Provider (CSP) is used to assist SP's processing, reducing the complexity of the protocol constructions while the leakage of information to CSP is limited. We present two constructions of SecureBoost: HE+GC and SecSh+GC, using combinations of homomorphic encryption, garbled circuits, and random masking to achieve both security and efficiency. We have conducted extensive experiments to understand the quality of the RLC-based boosting and the cost distribution of the constructions. The result shows that SecureBoost efficiently learns high-quality boosting models from protected user-generated data.



page 1

page 2

page 3

page 4


Boosting Privately: Privacy-Preserving Federated Extreme Boosting for Mobile Crowdsensing

The state-of-the-art federated learning brings a new direction for the d...

Enabling Strong Privacy Preservation and Accurate Task Allocation for Mobile Crowdsensing

Mobile crowdsensing engages a crowd of individuals to use their mobile d...

Safepaths: Vaccine Diary Protocol and Decentralized Vaccine Coordination System using a Privacy Preserving User Centric Experience

In this early draft, we present an end-to-end decentralized protocol for...

Privacy-Preserving Boosting in the Local Setting

In machine learning, boosting is one of the most popular methods that de...

Cloud-based Federated Boosting for Mobile Crowdsensing

The application of federated extreme gradient boosting to mobile crowdse...

UPPRESSO: Untraceable and Unlinkable Privacy-PREserving Single Sign-On Services

Single sign-on (SSO) allows a user to maintain only the credential at th...

Multimodal Privacy-preserving Mood Prediction from Mobile Data: A Preliminary Study

Mental health conditions remain under-diagnosed even in countries with c...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.