Improving Prediction Backward-Compatiblility in NLP Model Upgrade with Gated Fusion

02/04/2023
by   Yi-An Lai, et al.
0

When upgrading neural models to a newer version, new errors that were not encountered in the legacy version can be introduced, known as regression errors. This inconsistent behavior during model upgrade often outweighs the benefits of accuracy gain and hinders the adoption of new models. To mitigate regression errors from model upgrade, distillation and ensemble have proven to be viable solutions without significant compromise in performance. Despite the progress, these approaches attained an incremental reduction in regression which is still far from achieving backward-compatible model upgrade. In this work, we propose a novel method, Gated Fusion, that promotes backward compatibility via learning to mix predictions between old and new models. Empirical results on two distinct model upgrade scenarios show that our method reduces the number of regression errors by 62 strongest baseline by an average of 25

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/07/2022

Measuring and Reducing Model Update Regression in Structured Prediction for NLP

Recent advance in deep learning has led to rapid adoption of machine lea...
research
08/07/2021

Neighborhood Consensus Contrastive Learning for Backward-Compatible Representation

In object re-identification (ReID), the development of deep learning tec...
research
05/07/2021

Regression Bugs Are In Your Model! Measuring, Reducing and Analyzing Regressions In NLP Model Updates

Behavior of deep neural networks can be inconsistent between different v...
research
01/25/2023

Backward Compatibility During Data Updates by Weight Interpolation

Backward compatibility of model predictions is a desired property when u...
research
10/13/2022

Darwinian Model Upgrades: Model Evolving with Selective Compatibility

The traditional model upgrading paradigm for retrieval requires recomput...
research
06/07/2022

Learning Backward Compatible Embeddings

Embeddings, low-dimensional vector representation of objects, are fundam...
research
06/23/2022

Backward baselines: Is your model predicting the past?

When does a machine learning model predict the future of individuals and...

Please sign up or login with your details

Forgot password? Click here to reset