The Prospect of Enhancing Large-Scale Heterogeneous Federated Learning with Transformers

08/07/2023
by   Yulan Gao, et al.
0

Federated learning (FL) addresses data privacy concerns by enabling collaborative training of AI models across distributed data owners. Wide adoption of FL faces the fundamental challenges of data heterogeneity and the large scale of data owners involved. In this paper, we investigate the prospect of Transformer-based FL models for achieving generalization and personalization in this setting. We conduct extensive comparative experiments involving FL with Transformers, ResNet, and personalized ResNet-based FL approaches under various scenarios. These experiments consider varying numbers of data owners to demonstrate Transformers' advantages over deep neural networks in large-scale heterogeneous FL tasks. In addition, we analyze the superior performance of Transformers by comparing the Centered Kernel Alignment (CKA) representation similarity across different layers and FL models to gain insight into the reasons behind their promising capabilities.

READ FULL TEXT

page 6

page 8

research
10/10/2022

A Survey on Heterogeneous Federated Learning

Federated learning (FL) has been proposed to protect data privacy and vi...
research
06/12/2020

Heterogeneity-Aware Federated Learning

Federated learning (FL) is an emerging distributed machine learning para...
research
05/29/2023

Deep Equilibrium Models Meet Federated Learning

In this study the problem of Federated Learning (FL) is explored under a...
research
09/16/2023

UNIDEAL: Curriculum Knowledge Distillation Federated Learning

Federated Learning (FL) has emerged as a promising approach to enable co...
research
08/10/2022

FedOBD: Opportunistic Block Dropout for Efficiently Training Large-scale Neural Networks through Federated Learning

Large-scale neural networks possess considerable expressive power. They ...
research
03/23/2023

Federated Learning on Heterogenous Data using Chest CT

Large data have accelerated advances in AI. While it is well known that ...
research
09/21/2023

Enabling Quartile-based Estimated-Mean Gradient Aggregation As Baseline for Federated Image Classifications

Federated Learning (FL) has revolutionized how we train deep neural netw...

Please sign up or login with your details

Forgot password? Click here to reset