Algorithms that Approximate Data Removal: New Results and Limitations

09/25/2022
by   Vinith M. Suriyakumar, et al.
0

We study the problem of deleting user data from machine learning models trained using empirical risk minimization. Our focus is on learning algorithms which return the empirical risk minimizer and approximate unlearning algorithms that comply with deletion requests that come streaming minibatches. Leveraging the infintesimal jacknife, we develop an online unlearning algorithm that is both computationally and memory efficient. Unlike prior memory efficient unlearning algorithms, we target models that minimize objectives with non-smooth regularizers, such as the commonly used ℓ_1, elastic net, or nuclear norm penalties. We also provide generalization, deletion capacity, and unlearning guarantees that are consistent with state of the art methods. Across a variety of benchmark datasets, our algorithm empirically improves upon the runtime of prior methods while maintaining the same memory requirements and test accuracy. Finally, we open a new direction of inquiry by proving that all approximate unlearning algorithms introduced so far fail to unlearn in problem settings where common hyperparameter tuning methods, such as cross-validation, have been used to select models.

READ FULL TEXT
research
06/29/2022

Approximate Data Deletion in Generative Models

Users have the right to have their data deleted by third-party learned s...
research
02/24/2020

Approximate Data Deletion from Machine Learning Models: Algorithms and Evaluations

Deleting data from a trained machine learning (ML) model is a critical t...
research
03/02/2020

Approximate Cross-validation: Guarantees for Model Assessment and Selection

Cross-validation (CV) is a popular approach for assessing and selecting ...
research
03/05/2023

Iterative Approximate Cross-Validation

Cross-validation (CV) is one of the most popular tools for assessing and...
research
05/21/2023

Random Relabeling for Efficient Machine Unlearning

Learning algorithms and data are the driving forces for machine learning...
research
06/08/2021

Adaptive Machine Unlearning

Data deletion algorithms aim to remove the influence of deleted data poi...

Please sign up or login with your details

Forgot password? Click here to reset