DeepAI AI Chat
Log In Sign Up

XOR Mixup: Privacy-Preserving Data Augmentation for One-Shot Federated Learning

by   MyungJae Shin, et al.

User-generated data distributions are often imbalanced across devices and labels, hampering the performance of federated learning (FL). To remedy to this non-independent and identically distributed (non-IID) data problem, in this work we develop a privacy-preserving XOR based mixup data augmentation technique, coined XorMixup, and thereby propose a novel one-shot FL framework, termed XorMixFL. The core idea is to collect other devices' encoded data samples that are decoded only using each device's own data samples. The decoding provides synthetic-but-realistic samples until inducing an IID dataset, used for model training. Both encoding and decoding procedures follow the bit-wise XOR operations that intentionally distort raw samples, thereby preserving data privacy. Simulation results corroborate that XorMixFL achieves up to 17.6


FedMix: Approximation of Mixup under Mean Augmented Federated Learning

Federated learning (FL) allows edge devices to collectively learn a mode...

Federated Learning on Heterogeneous and Long-Tailed Data via Classifier Re-Training with Federated Features

Federated learning (FL) provides a privacy-preserving solution for distr...

Multi-hop Federated Private Data Augmentation with Sample Compression

On-device machine learning (ML) has brought about the accessibility to a...

Privacy Sensitive Speech Analysis Using Federated Learning to Assess Depression

Recent studies have used speech signals to assess depression. However, s...

Fed-TDA: Federated Tabular Data Augmentation on Non-IID Data

Non-independent and identically distributed (non-IID) data is a key chal...

Privacy-preserving Federated Bayesian Learning of a Generative Model for Imbalanced Classification of Clinical Data

In clinical research, the lack of events of interest often necessitates ...

Zero-Shot Federated Learning with New Classes for Audio Classification

Federated learning is an effective way of extracting insights from diffe...