Heterogeneous Ensemble Knowledge Transfer for Training Large Models in Federated Learning

04/27/2022
by   Yae Jee Cho, et al.
0

Federated learning (FL) enables edge-devices to collaboratively learn a model without disclosing their private data to a central aggregating server. Most existing FL algorithms require models of identical architecture to be deployed across the clients and server, making it infeasible to train large models due to clients' limited system resources. In this work, we propose a novel ensemble knowledge transfer method named Fed-ET in which small models (different in architecture) are trained on clients, and used to train a larger model at the server. Unlike in conventional ensemble learning, in FL the ensemble can be trained on clients' highly heterogeneous data. Cognizant of this property, Fed-ET uses a weighted consensus distillation scheme with diversity regularization that efficiently extracts reliable consensus from the ensemble while improving generalization by exploiting the diversity within the ensemble. We show the generalization bound for the ensemble of weighted models trained on heterogeneous datasets that supports the intuition of Fed-ET. Our experiments on image and language tasks show that Fed-ET significantly outperforms other state-of-the-art FL algorithms with fewer communicated parameters, and is also robust against high data-heterogeneity.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2021

Personalized Federated Learning for Heterogeneous Clients with Clustered Knowledge Transfer

Personalized federated learning (FL) aims to train model(s) that can per...
research
07/21/2021

Fed-ensemble: Improving Generalization through Model Ensembling in Federated Learning

In this paper we propose Fed-ensemble: a simple approach that bringsmode...
research
11/20/2022

FedDCT: Federated Learning of Large Convolutional Neural Networks on Resource Constrained Devices using Divide and Co-Training

We introduce FedDCT, a novel distributed learning paradigm that enables ...
research
10/27/2022

Exploiting Features and Logits in Heterogeneous Federated Learning

Due to the rapid growth of IoT and artificial intelligence, deploying ne...
research
01/13/2023

Contrast with Major Classifier Vectors for Federated Medical Relation Extraction with Heterogeneous Label Distribution

Federated medical relation extraction enables multiple clients to train ...
research
01/20/2022

Federated Learning with Heterogeneous Architectures using Graph HyperNetworks

Standard Federated Learning (FL) techniques are limited to clients with ...
research
04/14/2022

Exploring the Distributed Knowledge Congruence in Proxy-data-free Federated Distillation

Federated learning (FL) is a distributed machine learning paradigm in wh...

Please sign up or login with your details

Forgot password? Click here to reset