Maintaining Stability and Plasticity for Predictive Churn Reduction

05/06/2023
by   George Adam, et al.
0

Deployed machine learning models should be updated to take advantage of a larger sample size to improve performance, as more data is gathered over time. Unfortunately, even when model updates improve aggregate metrics such as accuracy, they can lead to errors on samples that were correctly predicted by the previous model causing per-sample regression in performance known as predictive churn. Such prediction flips erode user trust thereby reducing the effectiveness of the human-AI team as a whole. We propose a solution called Accumulated Model Combination (AMC) based keeping the previous and current model version, and generating a meta-output using the prediction of the two models. AMC is a general technique and we propose several instances of it, each having their own advantages depending on the model and data properties. AMC requires minimal additional computation and changes to training procedures. We motivate the need for AMC by showing the difficulty of making a single model consistent with its own predictions throughout training thereby revealing an implicit stability-plasticity tradeoff when training a single model. We demonstrate the effectiveness of AMC on a variety of modalities including computer vision, text, and tabular datasets comparing against state-of-the-art churn reduction methods, and showing superior churn reduction ability compared to all existing methods while being more efficient than ensembles.

READ FULL TEXT

page 8

page 16

page 17

page 21

page 22

page 23

research
08/14/2020

How little data do we need for patient-level prediction?

Objective: Provide guidance on sample size considerations for developing...
research
06/04/2019

A Case for Backward Compatibility for Human-AI Teams

AI systems are being deployed to support human decision making in high-s...
research
11/03/2009

Feature-Weighted Linear Stacking

Ensemble methods, such as stacking, are designed to boost predictive acc...
research
01/18/2023

Performance-Preserving Event Log Sampling for Predictive Monitoring

Predictive process monitoring is a subfield of process mining that aims ...
research
06/25/2019

A comparison of apartment rent price prediction using a large dataset: Kriging versus DNN

The hedonic approach based on a regression model has been widely adopted...
research
01/26/2021

Better sampling in explanation methods can prevent dieselgate-like deception

Machine learning models are used in many sensitive areas where besides p...

Please sign up or login with your details

Forgot password? Click here to reset