A Privacy-Preserving Federated Learning Approach for Kernel methods

06/05/2023
by   Anika Hannemann, et al.
0

It is challenging to implement Kernel methods, if the data sources are distributed and cannot be joined at a trusted third party for privacy reasons. It is even more challenging, if the use case rules out privacy-preserving approaches that introduce noise. An example for such a use case is machine learning on clinical data. To realize exact privacy preserving computation of kernel methods, we propose FLAKE, a Federated Learning Approach for KErnel methods on horizontally distributed data. With FLAKE, the data sources mask their data so that a centralized instance can compute a Gram matrix without compromising privacy. The Gram matrix allows to calculate many kernel matrices, which can be used to train kernel-based machine learning algorithms such as Support Vector Machines. We prove that FLAKE prevents an adversary from learning the input data or the number of input features under a semi-honest threat model. Experiments on clinical and synthetic data confirm that FLAKE is outperforming the accuracy and efficiency of comparable methods. The time needed to mask the data and to compute the Gram matrix is several orders of magnitude less than the time a Support Vector Machine needs to be trained. Thus, FLAKE can be applied to many use cases.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2022

Towards Privacy-Aware Causal Structure Learning in Federated Setting

Causal structure learning has been extensively studied and widely used i...
research
02/07/2022

CECILIA: Comprehensive Secure Machine Learning Framework

Since machine learning algorithms have proven their success in data mini...
research
03/29/2021

Privacy and Trust Redefined in Federated Machine Learning

A common privacy issue in traditional machine learning is that data need...
research
10/25/2019

Substra: a framework for privacy-preserving, traceable and collaborative Machine Learning

Machine learning is promising, but it often needs to process vast amount...
research
03/17/2023

Multi-Task Model Personalization for Federated Supervised SVM in Heterogeneous Networks

In this paper, we design an efficient distributed iterative learning met...
research
12/29/2019

Privacy-Preserving Public Release of Datasets for Support Vector Machine Classification

We consider the problem of publicly releasing a dataset for support vect...
research
07/08/2019

Privacy-Preserving Classification with Secret Vector Machines

Today, large amounts of valuable data are distributed among millions of ...

Please sign up or login with your details

Forgot password? Click here to reset