Privacy-preserving Data Filtering in Federated Learning Using Influence Approximation

05/23/2022
by   Ljubomir Rokvic, et al.
0

Federated Learning by nature is susceptible to low-quality, corrupted, or even malicious data that can severely degrade the quality of the learned model. Traditional techniques for data valuation cannot be applied as the data is never revealed. We present a novel technique for filtering, and scoring data based on a practical influence approximation that can be implemented in a privacy-preserving manner. Each agent uses his own data to evaluate the influence of another agent's batch, and reports to the center an obfuscated score using differential privacy. Our technique allows for almost perfect (>92% recall) filtering of corrupted data in a variety of applications using real-data. Importantly, the accuracy does not degrade significantly, even under really strong privacy guarantees (ε≤ 1), especially under realistic percentages of mislabeled data (for 15% mislabeled data we only lose 10% in accuracy).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/30/2023

Vision Through the Veil: Differential Privacy in Federated Learning for Medical Image Classification

The proliferation of deep learning applications in healthcare calls for ...
research
02/19/2020

PrivacyFL: A simulator for privacy-preserving and secure federated learning

Federated learning is a technique that enables distributed clients to co...
research
11/10/2020

Compression Boosts Differentially Private Federated Learning

Federated Learning allows distributed entities to train a common model c...
research
04/29/2021

Privacy-Preserving Federated Learning on Partitioned Attributes

Real-world data is usually segmented by attributes and distributed acros...
research
03/01/2023

FedScore: A privacy-preserving framework for federated scoring system development

We propose FedScore, a privacy-preserving federated learning framework f...
research
10/18/2019

Privacy-preserving Federated Bayesian Learning of a Generative Model for Imbalanced Classification of Clinical Data

In clinical research, the lack of events of interest often necessitates ...
research
01/08/2022

Attacking Vertical Collaborative Learning System Using Adversarial Dominating Inputs

Vertical collaborative learning system also known as vertical federated ...

Please sign up or login with your details

Forgot password? Click here to reset