Mitigating Data Heterogeneity in Federated Learning with Data Augmentation

06/20/2022
by   Artur Back de Luca, et al.
0

Federated Learning (FL) is a prominent framework that enables training a centralized model while securing user privacy by fusing local, decentralized models. In this setting, one major obstacle is data heterogeneity, i.e., each client having non-identically and independently distributed (non-IID) data. This is analogous to the context of Domain Generalization (DG), where each client can be treated as a different domain. However, while many approaches in DG tackle data heterogeneity from the algorithmic perspective, recent evidence suggests that data augmentation can induce equal or greater performance. Motivated by this connection, we present federated versions of popular DG algorithms, and show that by applying appropriate data augmentation, we can mitigate data heterogeneity in the federated setting, and obtain higher accuracy on unseen clients. Equipped with data augmentation, we can achieve state-of-the-art performance using even the most basic Federated Averaging algorithm, with much sparser communication.

READ FULL TEXT
research
04/27/2021

Towards Fair Federated Learning with Zero-Shot Data Augmentation

Federated learning has emerged as an important distributed learning para...
research
06/14/2023

A Simple Data Augmentation for Feature Distribution Skewed Federated Learning

Federated learning (FL) facilitates collaborative learning among multipl...
research
07/11/2023

Benchmarking Algorithms for Federated Domain Generalization

While prior domain generalization (DG) benchmarks consider train-test da...
research
07/01/2021

FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Federated learning (FL) allows edge devices to collectively learn a mode...
research
09/03/2023

A Comparative Evaluation of FedAvg and Per-FedAvg Algorithms for Dirichlet Distributed Heterogeneous Data

In this paper, we investigate Federated Learning (FL), a paradigm of mac...
research
11/22/2022

Fed-TDA: Federated Tabular Data Augmentation on Non-IID Data

Non-independent and identically distributed (non-IID) data is a key chal...
research
08/30/2023

Federated Two Stage Decoupling With Adaptive Personalization Layers

Federated learning has gained significant attention due to its groundbre...

Please sign up or login with your details

Forgot password? Click here to reset