On Pre-Training for Federated Learning

06/23/2022
by   Hong-You Chen, et al.
0

In most of the literature on federated learning (FL), neural networks are initialized with random weights. In this paper, we present an empirical study on the effect of pre-training on FL. Specifically, we aim to investigate if pre-training can alleviate the drastic accuracy drop when clients' decentralized data are non-IID. We focus on FedAvg, the fundamental and most widely used FL algorithm. We found that pre-training does largely close the gap between FedAvg and centralized learning under non-IID data, but this does not come from alleviating the well-known model drifting problem in FedAvg's local training. Instead, how pre-training helps FedAvg is by making FedAvg's global aggregation more stable. When pre-training using real data is not feasible for FL, we propose a novel approach to pre-train with synthetic data. On various image datasets (including one for segmentation), our approach with synthetic pre-training leads to a notable gain, essentially a critical step toward scaling up federated learning for real-world applications.

READ FULL TEXT

page 7

page 15

research
01/28/2023

CyclicFL: A Cyclic Model Pre-Training Approach to Efficient Federated Learning

Since random initial models in Federated Learning (FL) can easily result...
research
07/12/2023

FDAPT: Federated Domain-adaptive Pre-training for Language Models

Combining Domain-adaptive Pre-training (DAPT) with Federated Learning (F...
research
05/04/2023

Can Fair Federated Learning reduce the need for Personalisation?

Federated Learning (FL) enables training ML models on edge clients witho...
research
02/04/2021

FedAUX: Leveraging Unlabeled Auxiliary Data in Federated Learning

Federated Distillation (FD) is a popular novel algorithmic paradigm for ...
research
09/03/2023

FedFwd: Federated Learning without Backpropagation

In federated learning (FL), clients with limited resources can disrupt t...
research
05/22/2023

Federated Learning of Medical Concepts Embedding using BEHRT

Electronic Health Records (EHR) data contains medical records such as di...
research
06/30/2022

Where to Begin? Exploring the Impact of Pre-Training and Initialization in Federated Learning

An oft-cited challenge of federated learning is the presence of data het...

Please sign up or login with your details

Forgot password? Click here to reset