Federated Forest

05/24/2019
by   Yang Liu, et al.
0

Most real-world data are scattered across different companies or government organizations, and cannot be easily integrated under data privacy and related regulations such as the European Union's General Data Protection Regulation (GDPR) and China' Cyber Security Law. Such data islands situation and data privacy & security are two major challenges for applications of artificial intelligence. In this paper, we tackle these challenges and propose a privacy-preserving machine learning model, called Federated Forest, which is a lossless learning model of the traditional random forest method, i.e., achieving the same level of accuracy as the non-privacy-preserving approach. Based on it, we developed a secure cross-regional machine learning system that allows a learning process to be jointly trained over different regions' clients with the same user samples but different attribute sets, processing the data stored in each of them without exchanging their raw data. A novel prediction algorithm was also proposed which could largely reduce the communication overhead. Experiments on both real-world and UCI data sets demonstrate the performance of the Federated Forest is as accurate as the non-federated version. The efficiency and robustness of our proposed system had been verified. Overall, our model is practical, scalable and extensible for real-life tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2020

Federated Extra-Trees with Privacy Preserving

It is commonly observed that the data are scattered everywhere and diffi...
research
01/25/2019

SecureBoost: A Lossless Federated Learning Framework

The protection of user privacy is an important concern in machine learni...
research
01/26/2022

An Efficient and Robust System for Vertically Federated Random Forest

As there is a growing interest in utilizing data across multiple resourc...
research
12/30/2019

Quantifying the Performance of Federated Transfer Learning

The scarcity of data and isolated data islands encourage different organ...
research
05/14/2023

Privacy-Preserving Taxi-Demand Prediction Using Federated Learning

Taxi-demand prediction is an important application of machine learning t...
research
09/05/2020

FLFE: A Communication-Efficient and Privacy-Preserving Federated Feature Engineering Framework

Feature engineering is the process of using domain knowledge to extract ...
research
08/20/2023

Federated Statistical Analysis: Non-parametric Testing and Quantile Estimation

The age of big data has fueled expectations for accelerating learning. T...

Please sign up or login with your details

Forgot password? Click here to reset