Privacy Risks of Explaining Machine Learning Models

06/29/2019
by   Reza Shokri, et al.
10

Can we trust black-box machine learning with its decisions? Can we trust algorithms to train machine learning models on sensitive data? Transparency and privacy are two fundamental elements of trust for adopting machine learning. In this paper, we investigate the relation between interpretability and privacy. In particular we analyze if an adversary can exploit transparent machine learning to infer sensitive information about its training set. To this end, we perform membership inference as well as reconstruction attacks on two popular classes of algorithms for explaining machine learning models: feature-based and record-based influence measures. We empirically show that an attacker, that only observes the feature-based explanations, has the same power as the state of the art membership inference attacks on model predictions. We also demonstrate that record-based explanations can be effectively exploited to reconstruct significant parts of the training set. Finally, our results indicate that minorities and special cases are more vulnerable to these type of attacks than majority groups.

READ FULL TEXT

page 14

page 15

page 16

research
10/18/2016

Membership Inference Attacks against Machine Learning Models

We quantitatively investigate how machine learning models leak informati...
research
06/28/2018

Towards Demystifying Membership Inference Attacks

Membership inference attacks seek to infer membership of individual trai...
research
02/07/2022

Deletion Inference, Reconstruction, and Compliance in Machine (Un)Learning

Privacy attacks on machine learning models aim to identify the data that...
research
09/05/2017

The Unintended Consequences of Overfitting: Training Data Inference Attacks

Machine learning algorithms that are applied to sensitive data pose a di...
research
01/26/2021

Better sampling in explanation methods can prevent dieselgate-like deception

Machine learning models are used in many sensitive areas where besides p...
research
12/06/2022

Achieving Transparency in Distributed Machine Learning with Explainable Data Collaboration

Transparency of Machine Learning models used for decision support in var...
research
06/28/2022

On the amplification of security and privacy risks by post-hoc explanations in machine learning models

A variety of explanation methods have been proposed in recent years to h...

Please sign up or login with your details

Forgot password? Click here to reset