Approximate Data Deletion in Generative Models

06/29/2022
by   Zhifeng Kong, et al.
12

Users have the right to have their data deleted by third-party learned systems, as codified by recent legislation such as the General Data Protection Regulation (GDPR) and the California Consumer Privacy Act (CCPA). Such data deletion can be accomplished by full re-training, but this incurs a high computational cost for modern machine learning models. To avoid this cost, many approximate data deletion methods have been developed for supervised learning. Unsupervised learning, in contrast, remains largely an open problem when it comes to (approximate or exact) efficient data deletion. In this paper, we propose a density-ratio-based framework for generative models. Using this framework, we introduce a fast method for approximate data deletion and a statistical test for estimating whether or not training points have been deleted. We provide theoretical guarantees under various learner assumptions and empirically demonstrate our methods across a variety of generative methods.

READ FULL TEXT

page 7

page 22

page 24

page 26

page 29

page 36

page 38

page 39

research
02/24/2020

Approximate Data Deletion from Machine Learning Models: Algorithms and Evaluations

Deleting data from a trained machine learning (ML) model is a critical t...
research
09/25/2022

Algorithms that Approximate Data Removal: New Results and Limitations

We study the problem of deleting user data from machine learning models ...
research
03/09/2020

Towards Probabilistic Verification of Machine Unlearning

Right to be forgotten, also known as the right to erasure, is the right ...
research
10/17/2022

Forget Unlearning: Towards True Data-Deletion in Machine Learning

Unlearning has emerged as a technique to efficiently erase information o...
research
06/26/2020

DeltaGrad: Rapid retraining of machine learning models

Machine learning models are not static and may need to be retrained on s...
research
06/08/2021

Adaptive Machine Unlearning

Data deletion algorithms aim to remove the influence of deleted data poi...
research
09/17/2021

Hard to Forget: Poisoning Attacks on Certified Machine Unlearning

The right to erasure requires removal of a user's information from data ...

Please sign up or login with your details

Forgot password? Click here to reset