Handling Data Heterogeneity with Generative Replay in Collaborative Learning for Medical Imaging

06/24/2021
by   Liangqiong Qu, et al.
0

Collaborative learning, which enables collaborative and decentralized training of deep neural networks at multiple institutions in a privacy-preserving manner, is rapidly emerging as a valuable technique in healthcare applications. However, its distributed nature often leads to significant heterogeneity in data distributions across institutions. Existing collaborative learning approaches generally do not account for the presence of heterogeneity in data among institutions, or only mildly skewed label distributions are studied. In this paper, we present a novel generative replay strategy to address the challenge of data heterogeneity in collaborative learning methods. Instead of directly training a model for task performance, we leverage recent image synthesis techniques to develop a novel dual model architecture: a primary model learns the desired task, and an auxiliary "generative replay model" either synthesizes images that closely resemble the input images or helps extract latent variables. The generative replay strategy is flexible to use, can either be incorporated into existing collaborative learning methods to improve their capability of handling data heterogeneity across institutions, or be used as a novel and individual collaborative learning framework (termed FedReplay) to reduce communication cost. Experimental results demonstrate the capability of the proposed method in handling heterogeneous data across institutions. On highly heterogeneous data partitions, our model achieves  4.88 a diabetic retinopathy classification dataset, and  49.8 absolution value on a Bone Age prediction dataset, respectively, compared to the state-of-the art collaborative learning methods.

READ FULL TEXT
research
07/18/2021

An Experimental Study of Data Heterogeneity in Federated Learning Methods for Medical Imaging

Federated learning enables multiple institutions to collaboratively trai...
research
08/19/2016

Large-scale Collaborative Imaging Genetics Studies of Risk Genetic Factors for Alzheimer's Disease Across Multiple Institutions

Genome-wide association studies (GWAS) offer new opportunities to identi...
research
07/04/2023

SelfFed: Self-supervised Federated Learning for Data Heterogeneity and Label Scarcity in IoMT

Self-supervised learning in federated learning paradigm has been gaining...
research
05/16/2023

Trustworthy Privacy-preserving Hierarchical Ensemble and Federated Learning in Healthcare 4.0 with Blockchain

The advancement of Internet and Communication Technologies (ICTs) has le...
research
05/14/2021

Privacy-Preserving Constrained Domain Generalization for Medical Image Classification

Deep neural networks (DNN) have demonstrated unprecedented success for m...
research
07/20/2020

Learning latent representations across multiple data domains using Lifelong VAEGAN

The problem of catastrophic forgetting occurs in deep learning models tr...

Please sign up or login with your details

Forgot password? Click here to reset