Online Forgetting Process for Linear Regression Models

12/03/2020
by   Yuantong Li, et al.
4

Motivated by the EU's "Right To Be Forgotten" regulation, we initiate a study of statistical data deletion problems where users' data are accessible only for a limited period of time. This setting is formulated as an online supervised learning task with constant memory limit. We propose a deletion-aware algorithm FIFD-OLS for the low dimensional case, and witness a catastrophic rank swinging phenomenon due to the data deletion operation, which leads to statistical inefficiency. As a remedy, we propose the FIFD-Adaptive Ridge algorithm with a novel online regularization scheme, that effectively offsets the uncertainty from deletion. In theory, we provide the cumulative regret upper bound for both online forgetting algorithms. In the experiment, we showed FIFD-Adaptive Ridge outperforms the ridge regression algorithm with fixed regularization level, and hopefully sheds some light on more complex statistical models.

READ FULL TEXT
research
11/02/2021

Stochastic Online Linear Regression: the Forward Algorithm to Replace Ridge

We consider the problem of online linear regression in the stochastic se...
research
01/17/2022

Evaluating Inexact Unlearning Requires Revisiting Forgetting

Existing works in inexact machine unlearning focus on achieving indistin...
research
02/06/2021

Online nonparametric regression with Sobolev kernels

In this work we investigate the variation of the online kernelized ridge...
research
10/17/2022

Forget Unlearning: Towards True Data-Deletion in Machine Learning

Unlearning has emerged as a technique to efficiently erase information o...
research
06/12/2021

Semi-supervised Active Regression

Labelled data often comes at a high cost as it may require recruiting hu...
research
05/25/2022

Deletion and Insertion Tests in Regression Models

A basic task in explainable AI (XAI) is to identify the most important f...
research
08/14/2023

Locally Adaptive and Differentiable Regression

Over-parameterized models like deep nets and random forests have become ...

Please sign up or login with your details

Forgot password? Click here to reset