On the Necessity of Auditable Algorithmic Definitions for Machine Unlearning

10/22/2021
by   Anvith Thudi, et al.
0

Machine unlearning, i.e. having a model forget about some of its training data, has become increasingly more important as privacy legislation promotes variants of the right-to-be-forgotten. In the context of deep learning, approaches for machine unlearning are broadly categorized into two classes: exact unlearning methods, where an entity has formally removed the data point's impact on the model by retraining the model from scratch, and approximate unlearning, where an entity approximates the model parameters one would obtain by exact unlearning to save on compute costs. In this paper we first show that the definition that underlies approximate unlearning, which seeks to prove the approximately unlearned model is close to an exactly retrained model, is incorrect because one can obtain the same model using different datasets. Thus one could unlearn without modifying the model at all. We then turn to exact unlearning approaches and ask how to verify their claims of unlearning. Our results show that even for a given training trajectory one cannot formally prove the absence of certain data points used during training. We thus conclude that unlearning is only well-defined at the algorithmic level, where an entity's only possible auditable claim to unlearning is that they used a particular algorithm designed to allow for external scrutiny during an audit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/21/2018

Looking for Bird Nests: Identifying Stay Points with Bounded Gaps

A stay point of a moving entity is a region in which it spends a signifi...
research
05/10/2018

Inference Attacks Against Collaborative Learning

Collaborative machine learning and related techniques such as distribute...
research
09/27/2021

Unrolling SGD: Understanding Factors Influencing Machine Unlearning

Machine unlearning is the process through which a deployed machine learn...
research
01/07/2019

Approximate Discontinuous Trajectory Hotspots

A hotspot is an axis-aligned square of fixed side length s, the duration...
research
07/07/2022

Challenges and Pitfalls of Bayesian Unlearning

Machine unlearning refers to the task of removing a subset of training d...
research
10/19/2022

Attaining Class-level Forgetting in Pretrained Model using Few Samples

In order to address real-world problems, deep learning models are jointl...

Please sign up or login with your details

Forgot password? Click here to reset