Benchmarking FedAvg and FedCurv for Image Classification Tasks

03/31/2023
by   Bruno Casella, et al.
0

Classic Machine Learning techniques require training on data available in a single data lake. However, aggregating data from different owners is not always convenient for different reasons, including security, privacy and secrecy. Data carry a value that might vanish when shared with others; the ability to avoid sharing the data enables industrial applications where security and privacy are of paramount importance, making it possible to train global models by implementing only local policies which can be run independently and even on air-gapped data centres. Federated Learning (FL) is a distributed machine learning approach which has emerged as an effective way to address privacy concerns by only sharing local AI models while keeping the data decentralized. Two critical challenges of Federated Learning are managing the heterogeneous systems in the same federated network and dealing with real data, which are often not independently and identically distributed (non-IID) among the clients. In this paper, we focus on the second problem, i.e., the problem of statistical heterogeneity of the data in the same federated network. In this setting, local models might be strayed far from the local optimum of the complete dataset, thus possibly hindering the convergence of the federated model. Several Federated Learning algorithms, such as FedAvg, FedProx and Federated Curvature (FedCurv), aiming at tackling the non-IID setting, have already been proposed. This work provides an empirical assessment of the behaviour of FedAvg and FedCurv in common non-IID scenarios. Results show that the number of epochs per round is an important hyper-parameter that, when tuned appropriately, can lead to significant performance gains while reducing the communication cost. As a side product of this work, we release the non-IID version of the datasets we used so to facilitate further comparisons from the FL community.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/26/2023

A Generalized Look at Federated Learning: Survey and Perspectives

Federated learning (FL) refers to a distributed machine learning framewo...
research
10/14/2019

Federated Learning for Coalition Operations

Machine Learning in coalition settings requires combining insights avail...
research
10/15/2021

FedSLD: Federated Learning with Shared Label Distribution for Medical Image Classification

Machine learning in medical research, by nature, needs careful attention...
research
05/05/2023

Data Station: Delegated, Trustworthy, and Auditable Computation to Enable Data-Sharing Consortia with a Data Escrow

Pooling and sharing data increases and distributes its value. But since ...
research
06/03/2022

Supernet Training for Federated Image Classification under System Heterogeneity

Efficient deployment of deep neural networks across many devices and res...
research
02/20/2023

Federated Gradient Matching Pursuit

Traditional machine learning techniques require centralizing all trainin...
research
03/19/2023

Experimenting with Normalization Layers in Federated Learning on non-IID scenarios

Training Deep Learning (DL) models require large, high-quality datasets,...

Please sign up or login with your details

Forgot password? Click here to reset