CADIS: Handling Cluster-skewed Non-IID Data in Federated Learning with Clustered Aggregation and Knowledge DIStilled Regularization

02/21/2023
by   Nang Hung Nguyen, et al.
0

Federated learning enables edge devices to train a global model collaboratively without exposing their data. Despite achieving outstanding advantages in computing efficiency and privacy protection, federated learning faces a significant challenge when dealing with non-IID data, i.e., data generated by clients that are typically not independent and identically distributed. In this paper, we tackle a new type of Non-IID data, called cluster-skewed non-IID, discovered in actual data sets. The cluster-skewed non-IID is a phenomenon in which clients can be grouped into clusters with similar data distributions. By performing an in-depth analysis of the behavior of a classification model's penultimate layer, we introduce a metric that quantifies the similarity between two clients' data distributions without violating their privacy. We then propose an aggregation scheme that guarantees equality between clusters. In addition, we offer a novel local training regularization based on the knowledge-distillation technique that reduces the overfitting problem at clients and dramatically boosts the training scheme's performance. We theoretically prove the superiority of the proposed aggregation over the benchmark FedAvg. Extensive experimental results on both standard public datasets and our in-house real-world dataset demonstrate that the proposed approach improves accuracy by up to 16 algorithm.

READ FULL TEXT

page 2

page 3

research
08/04/2022

FedDRL: Deep Reinforcement Learning-based Adaptive Aggregation for Non-IID Data in Federated Learning

The uneven distribution of local data across different edge devices (cli...
research
04/14/2021

Towards Causal Federated Learning For Enhanced Robustness and Privacy

Federated Learning is an emerging privacy-preserving distributed machine...
research
05/02/2022

Performance Weighting for Robust Federated Learning Against Corrupted Sources

Federated Learning has emerged as a dominant computational paradigm for ...
research
08/20/2022

FLIS: Clustered Federated Learning via Inference Similarity for Non-IID Data Distribution

Classical federated learning approaches yield significant performance de...
research
06/25/2023

Private Aggregation in Wireless Federated Learning with Heterogeneous Clusters

Federated learning collaboratively trains a neural network on privately ...
research
11/04/2022

Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis

In diverse fields ranging from finance to omics, it is increasingly comm...
research
04/26/2021

Semi-Decentralized Federated Edge Learning for Fast Convergence on Non-IID Data

Federated edge learning (FEEL) has emerged as an effective alternative t...

Please sign up or login with your details

Forgot password? Click here to reset