Measuring the Effects of Non-Identical Data Distribution for Federated Visual Classification

09/13/2019
by   Tzu Ming Harry Hsu, et al.
0

Federated Learning enables visual models to be trained in a privacy-preserving way using real-world data from mobile devices. Given their distributed nature, the statistics of the data across these devices is likely to differ significantly. In this work, we look at the effect such non-identical data distributions has on visual classification via Federated Learning. We propose a way to synthesize datasets with a continuous range of identicalness and provide performance measures for the Federated Averaging algorithm. We show that performance degrades as distributions differ more, and propose a mitigation strategy via server momentum. Experiments on CIFAR-10 demonstrate improved classification performance over a range of non-identicalness, with classification accuracy improved from 30.1 settings.

READ FULL TEXT

page 2

page 3

research
03/18/2020

Federated Visual Classification with Real-World Data Distribution

Federated Learning enables visual models to be trained on-device, bringi...
research
03/07/2020

Ternary Compression for Communication-Efficient Federated Learning

Learning over massive data stored in different locations is essential in...
research
08/01/2023

Data Collaboration Analysis applied to Compound Datasets and the Introduction of Projection data to Non-IID settings

Given the time and expense associated with bringing a drug to market, nu...
research
06/17/2020

FedCD: Improving Performance in non-IID Federated Learning

Federated learning has been widely applied to enable decentralized devic...
research
07/30/2019

A Federated Learning Approach for Mobile Packet Classification

In order to improve mobile data transparency, a number of network-based ...
research
03/20/2021

Demystifying the Effects of Non-Independence in Federated Learning

Federated Learning (FL) enables statistical models to be built on user-g...
research
09/28/2021

Federated Learning Algorithms for Generalized Mixed-effects Model (GLMM) on Horizontally Partitioned Data from Distributed Sources

Objectives: This paper develops two algorithms to achieve federated gene...

Please sign up or login with your details

Forgot password? Click here to reset