The Lifecycle of a Statistical Model: Model Failure Detection, Identification, and Refitting

02/08/2022
by   Alnur Ali, et al.
0

The statistical machine learning community has demonstrated considerable resourcefulness over the years in developing highly expressive tools for estimation, prediction, and inference. The bedrock assumptions underlying these developments are that the data comes from a fixed population and displays little heterogeneity. But reality is significantly more complex: statistical models now routinely fail when released into real-world systems and scientific applications, where such assumptions rarely hold. Consequently, we pursue a different path in this paper vis-a-vis the well-worn trail of developing new methodology for estimation and prediction. In this paper, we develop tools and theory for detecting and identifying regions of the covariate space (subpopulations) where model performance has begun to degrade, and study intervening to fix these failures through refitting. We present empirical results with three real-world data sets – including a time series involving forecasting the incidence of COVID-19 – showing that our methodology generates interpretable results, is useful for tracking model performance, and can boost model performance through refitting. We complement these empirical results with theory proving that our methodology is minimax optimal for recovering anomalous subpopulations as well as refitting to improve accuracy in a structured normal means setting.

READ FULL TEXT
research
12/15/2020

Learning Prediction Intervals for Model Performance

Understanding model performance on unlabeled data is a fundamental chall...
research
09/03/2020

Statistical characterization and time-series modeling of seismic noise

Developing statistical models for seismic noise is an exercise of high v...
research
05/16/2020

Forecasting with sktime: Designing sktime's New Forecasting API and Applying It to Replicate and Extend the M4 Study

We present a new open-source framework for forecasting in Python. Our fr...
research
08/03/2016

Empirical Evaluation of Real World Tournaments

Computational Social Choice (ComSoc) is a rapidly developing field at th...
research
02/02/2020

Detecting Anomalous Time Series by GAMLSS-Akaike-Weights-Scoring

An extensible statistical framework for detecting anomalous time series ...
research
06/02/2020

Interpretable Meta-Measure for Model Performance

Measures for evaluation of model performance play an important role in M...
research
06/05/2020

Anomaly detection on streamed data

We introduce powerful but simple methodology for identifying anomalous o...

Please sign up or login with your details

Forgot password? Click here to reset