Data Selection for Federated Learning with Relevant and Irrelevant Data at Clients

01/22/2020
by   Tiffany Tuor, et al.
0

Federated learning is an effective way of training a machine learning model from data collected by client devices. A challenge is that among the large variety of data collected at each client, it is likely that only a subset is relevant for a learning task while the rest of data has a negative impact on model training. Therefore, before starting the learning process, it is important to select the subset of data that is relevant to the given federated learning task. In this paper, we propose a method for distributedly selecting relevant data, where we use a benchmark model trained on a small benchmark dataset that is task-specific, to evaluate the relevance of individual data samples at each client and select the data with sufficiently high relevance. Then, each client only uses the selected subset of its data in the federated learning process. The effectiveness of our proposed approach is evaluated on multiple real-world datasets in a simulated system with a large number of clients, showing up to 25% improvement in model accuracy compared to training with all data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/13/2023

FilFL: Accelerating Federated Learning via Client Filtering

Federated learning is an emerging machine learning paradigm that enables...
research
04/14/2022

Learning Task-Aware Energy Disaggregation: a Federated Approach

We consider the problem of learning the energy disaggregation signals fo...
research
12/05/2022

Unexpectedly Useful: Convergence Bounds And Real-World Distributed Learning

Convergence bounds are one of the main tools to obtain information on th...
research
10/23/2021

Game of Gradients: Mitigating Irrelevant Clients in Federated Learning

The paradigm of Federated learning (FL) deals with multiple clients part...
research
05/03/2022

FedRN: Exploiting k-Reliable Neighbors Towards Robust Federated Learning

Robustness is becoming another important challenge of federated learning...
research
07/14/2021

IFedAvg: Interpretable Data-Interoperability for Federated Learning

Recently, the ever-growing demand for privacy-oriented machine learning ...
research
08/14/2018

Mitigating Sybils in Federated Learning Poisoning

Machine learning (ML) over distributed data is relevant to a variety of ...

Please sign up or login with your details

Forgot password? Click here to reset