Positive-Congruent Training: Towards Regression-Free Model Updates

11/18/2020
by   Sijie Yan, et al.
10

Reducing inconsistencies in the behavior of different versions of an AI system can be as important in practice as reducing its overall error. In image classification, sample-wise inconsistencies appear as "negative flips:" A new model incorrectly predicts the output for a test sample that was correctly classified by the old (reference) model. Positive-congruent (PC) training aims at reducing error rate while at the same time reducing negative flips, thus maximizing congruency with the reference model only on positive predictions, unlike model distillation. We propose a simple approach for PC training, Focal Distillation, which enforces congruence with the reference model by giving more weights to samples that were correctly classified. We also found that, if the reference model itself can be chosen as an ensemble of multiple deep neural networks, negative flips can be further reduced without affecting the new model's accuracy.

READ FULL TEXT
research
05/07/2021

Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates

Behavior of deep neural networks can be inconsistent between different v...
research
01/25/2023

Backward Compatibility During Data Updates by Weight Interpolation

Backward compatibility of model predictions is a desired property when u...
research
03/16/2022

Reducing Flipping Errors in Deep Neural Networks

Deep neural networks (DNNs) have been widely applied in various domains ...
research
01/24/2022

Hot-Refresh Model Upgrades with Regression-Alleviating Compatible Training in Image Retrieval

The task of hot-refresh model upgrades of image retrieval systems plays ...
research
05/14/2022

Practical Insights of Repairing Model Problems on Image Classification

Additional training of a deep learning model can cause negative effects ...
research
02/03/2023

Towards a responsible machine learning approach to identify forced labor in fisheries

Many fishing vessels use forced labor, but identifying vessels that enga...
research
02/07/2022

Measuring and Reducing Model Update Regression in Structured Prediction for NLP

Recent advance in deep learning has led to rapid adoption of machine lea...

Please sign up or login with your details

Forgot password? Click here to reset