Orchestra: Unsupervised Federated Learning via Globally Consistent Clustering

05/23/2022
by   Ekdeep Singh Lubana, et al.
0

Federated learning is generally used in tasks where labels are readily available (e.g., next word prediction). Relaxing this constraint requires design of unsupervised learning techniques that can support desirable properties for federated training: robustness to statistical/systems heterogeneity, scalability with number of participants, and communication efficiency. Prior work on this topic has focused on directly extending centralized self-supervised learning techniques, which are not designed to have the properties listed above. To address this situation, we propose Orchestra, a novel unsupervised federated learning technique that exploits the federation's hierarchy to orchestrate a distributed clustering task and enforce a globally consistent partitioning of clients' data into discriminable clusters. We show the algorithmic pipeline in Orchestra guarantees good generalization performance under a linear probe, allowing it to outperform alternative techniques in a broad range of conditions, including variation in heterogeneity, number of clients, participation ratio, and local epochs.

READ FULL TEXT

page 7

page 18

research
06/17/2022

Federated learning with incremental clustering for heterogeneous data

Federated learning enables different parties to collaboratively build a ...
research
08/16/2022

HetVis: A Visual Analysis Approach for Identifying Data Heterogeneity in Horizontal Federated Learning

Horizontal federated learning (HFL) enables distributed clients to train...
research
07/17/2022

Federated Self-Supervised Learning in Heterogeneous Settings: Limits of a Baseline Approach on HAR

Federated Learning is a new machine learning paradigm dealing with distr...
research
07/10/2022

FedSS: Federated Learning with Smart Selection of clients

Federated learning provides the ability to learn over heterogeneous user...
research
11/30/2022

Federated deep clustering with GAN-based data synthesis

Clustering has been extensively studied in centralized settings, but rel...
research
11/04/2022

Heterogeneity-aware Clustered Distributed Learning for Multi-source Data Analysis

In diverse fields ranging from finance to omics, it is increasingly comm...

Please sign up or login with your details

Forgot password? Click here to reset