The Tradeoff Between Privacy and Accuracy in Anomaly Detection Using Federated XGBoost

07/16/2019
by   Mengwei Yang, et al.
0

Privacy has raised considerable concerns recently, especially with the advent of information explosion and numerous data mining techniques to explore the information inside large volumes of data. In this context, a new distributed learning paradigm termed federated learning becomes prominent recently to tackle the privacy issues in distributed learning, where only learning models will be transmitted from the distributed nodes to servers without revealing users' own data and hence protecting the privacy of users. In this paper, we propose a horizontal federated XGBoost algorithm to solve the federated anomaly detection problem, where the anomaly detection aims to identify abnormalities from extremely unbalanced datasets and can be considered as a special classification problem. Our proposed federated XGBoost algorithm incorporates data aggregation and sparse federated update processes to balance the tradeoff between privacy and learning performance. In particular, we introduce the virtual data sample by aggregating a group of users' data together at a single distributed node. We compute parameters based on these virtual data samples in the local nodes and aggregate the learning model in the central server. In the learning model upgrading process, we focus more on the wrongly classified data before in the virtual sample and hence to generate sparse learning model parameters. By carefully controlling the size of these groups of samples, we can achieve a tradeoff between privacy and learning performance. Our experimental results show the effectiveness of our proposed scheme by comparing with existing state-of-the-arts.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/16/2022

Federated Anomaly Detection over Distributed Data Streams

Sharing of telecommunication network data, for example, even at high agg...
research
01/09/2022

Meta-Generalization for Multiparty Privacy Learning to Identify Anomaly Multimedia Traffic in Graynet

Identifying anomaly multimedia traffic in cyberspace is a big challenge ...
research
11/04/2021

A Personalized Federated Learning Algorithm: an Application in Anomaly Detection

Federated Learning (FL) has recently emerged as a promising method that ...
research
02/04/2021

SAFELearning: Enable Backdoor Detectability In Federated Learning With Secure Aggregation

For model privacy, local model parameters in federated learning shall be...
research
10/06/2021

Two-Bit Aggregation for Communication Efficient and Differentially Private Federated Learning

In federated learning (FL), a machine learning model is trained on multi...
research
03/17/2023

Multi-Task Model Personalization for Federated Supervised SVM in Heterogeneous Networks

In this paper, we design an efficient distributed iterative learning met...
research
05/02/2023

Federated Neural Radiance Fields

The ability of neural radiance fields or NeRFs to conduct accurate 3D mo...

Please sign up or login with your details

Forgot password? Click here to reset