Anonymizing Data for Privacy-Preserving Federated Learning

02/21/2020
by   Olivia Choudhury, et al.
0

Federated learning enables training a global machine learning model from data distributed across multiple sites, without having to move the data. This is particularly relevant in healthcare applications, where data is rife with personal, highly-sensitive information, and data analysis methods must provably comply with regulatory guidelines. Although federated learning prevents sharing raw data, it is still possible to launch privacy attacks on the model parameters that are exposed during the training process, or on the generated machine learning model. In this paper, we propose the first syntactic approach for offering privacy in the context of federated learning. Unlike the state-of-the-art differential privacy-based frameworks, our approach aims to maximize utility or model performance, while supporting a defensible level of privacy, as demanded by GDPR and HIPAA. We perform a comprehensive empirical evaluation on two important problems in the healthcare domain, using real-world electronic health data of 1 million patients. The results demonstrate the effectiveness of our approach in achieving high model performance, while offering the desired level of privacy. Through comparative studies, we also show that, for varying datasets, experimental setups, and privacy budgets, our approach offers higher model performance than differential privacy-based techniques in federated learning.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/07/2019

Differential Privacy-enabled Federated Learning for Sensitive Health Data

Leveraging real-world health data for machine learning tasks requires ad...
research
10/04/2019

Confederated Machine Learning on Horizontally and Vertically Separated Medical Data for Large-Scale Health System Intelligence

A patient's health information is generally fragmented across silos. Tho...
research
06/23/2020

Security and Privacy Preserving Deep Learning

Commercial companies that collect user data on a large scale have been t...
research
08/30/2021

Private Multi-Task Learning: Formulation and Applications to Federated Learning

Many problems in machine learning rely on multi-task learning (MTL), in ...
research
02/08/2023

Exploratory Analysis of Federated Learning Methods with Differential Privacy on MIMIC-III

Background: Federated learning methods offer the possibility of training...
research
10/02/2019

Privacy-preserving Federated Brain Tumour Segmentation

Due to medical data privacy regulations, it is often infeasible to colle...
research
09/21/2020

Federated Learning for Computational Pathology on Gigapixel Whole Slide Images

Deep Learning-based computational pathology algorithms have demonstrated...

Please sign up or login with your details

Forgot password? Click here to reset