Sharing Models or Coresets: A Study based on Membership Inference Attack

07/06/2020
by   Hanlin Lu, et al.
0

Distributed machine learning generally aims at training a global model based on distributed data without collecting all the data to a centralized location, where two different approaches have been proposed: collecting and aggregating local models (federated learning) and collecting and training over representative data summaries (coreset). While each approach preserves data privacy to some extent thanks to not sharing the raw data, the exact extent of protection is unclear under sophisticated attacks that try to infer the raw data from the shared information. We present the first comparison between the two approaches in terms of target model accuracy, communication cost, and data privacy, where the last is measured by the accuracy of a state-of-the-art attack strategy called the membership inference attack. Our experiments quantify the accuracy-privacy-cost tradeoff of each approach, and reveal a nontrivial comparison that can be used to guide the design of model training processes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/15/2019

Reconciling Utility and Membership Privacy via Knowledge Distillation

Large capacity machine learning models are prone to membership inference...
research
03/27/2023

PADME-SoSci: A Platform for Analytics and Distributed Machine Learning for the Social Sciences

Data privacy and ownership are significant in social data science, raisi...
research
12/15/2022

White-box Inference Attacks against Centralized Machine Learning and Federated Learning

With the development of information science and technology, various indu...
research
05/06/2021

Membership Inference Attacks on Deep Regression Models for Neuroimaging

Ensuring the privacy of research participants is vital, even more so in ...
research
08/01/2023

Enhanced Security with Encrypted Vision Transformer in Federated Learning

Federated learning is a learning method for training models over multipl...
research
02/10/2022

PPA: Preference Profiling Attack Against Federated Learning

Federated learning (FL) trains a global model across a number of decentr...
research
11/09/2021

Data privacy protection in microscopic image analysis for material data mining

Recent progress in material data mining has been driven by high-capacity...

Please sign up or login with your details

Forgot password? Click here to reset