PrIU: A Provenance-Based Approach for Incrementally Updating Regression Models

02/26/2020
by   Yinjun Wu, et al.
0

The ubiquitous use of machine learning algorithms brings new challenges to traditional database problems such as incremental view update. Much effort is being put in better understanding and debugging machine learning models, as well as in identifying and repairing errors in training datasets. Our focus is on how to assist these activities when they have to retrain the machine learning model after removing problematic training samples in cleaning or selecting different subsets of training data for interpretability. This paper presents an efficient provenance-based approach, PrIU, and its optimized version, PrIU-opt, for incrementally updating model parameters without sacrificing prediction accuracy. We prove the correctness and convergence of the incrementally updated model parameters, and validate it experimentally. Experimental results show that up to two orders of magnitude speed-ups can be achieved by PrIU-opt compared to simply retraining the model from scratch, yet obtaining highly similar models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2019

Learning a Behavior Model of Hybrid Systems Through Combining Model-Based Testing and Machine Learning (Full Version)

Models play an essential role in the design process of cyber-physical sy...
research
02/17/2020

Subset Sampling For Progressive Neural Network Learning

Progressive Neural Network Learning is a class of algorithms that increm...
research
01/28/2023

Adversarial Learning Networks: Source-free Unsupervised Domain Incremental Learning

This work presents an approach for incrementally updating deep neural ne...
research
12/11/2022

ezDPS: An Efficient and Zero-Knowledge Machine Learning Inference Pipeline

Machine Learning as a service (MLaaS) permits resource-limited clients t...
research
09/11/2020

DART: Data Addition and Removal Trees

How can we update data for a machine learning model after it has already...
research
06/08/2023

A Gradient-based Approach for Online Robust Deep Neural Network Training with Noisy Labels

Learning with noisy labels is an important topic for scalable training i...
research
02/13/2023

Incremental Consistent Updating of Incomplete Databases

Efficient consistency maintenance of incomplete and dynamic real-life da...

Please sign up or login with your details

Forgot password? Click here to reset