Mind the Performance Gap: Examining Dataset Shift During Prospective Validation

07/23/2021
by   Erkin Ötleş, et al.
0

Once integrated into clinical care, patient risk stratification models may perform worse compared to their retrospective performance. To date, it is widely accepted that performance will degrade over time due to changes in care processes and patient populations. However, the extent to which this occurs is poorly understood, in part because few researchers report prospective validation performance. In this study, we compare the 2020-2021 ('20-'21) prospective performance of a patient risk stratification model for predicting healthcare-associated infections to a 2019-2020 ('19-'20) retrospective validation of the same model. We define the difference in retrospective and prospective performance as the performance gap. We estimate how i) "temporal shift", i.e., changes in clinical workflows and patient populations, and ii) "infrastructure shift", i.e., changes in access, extraction and transformation of data, both contribute to the performance gap. Applied prospectively to 26,864 hospital encounters during a twelve-month period from July 2020 to June 2021, the model achieved an area under the receiver operating characteristic curve (AUROC) of 0.767 (95 score of 0.189 (95 slightly compared to '19-'20 retrospective performance, in which the model achieved an AUROC of 0.778 (95 (95 infrastructure shift and not temporal shift. So long as we continue to develop and validate models using data stored in large research data warehouses, we must consider differences in how and when data are accessed, measure how these differences may affect prospective performance, and work to mitigate those differences.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/05/2021

Predicting Antimicrobial Resistance in the Intensive Care Unit

Antimicrobial resistance (AMR) is a risk for patients and a burden for t...
research
06/09/2020

A Machine Learning Early Warning System: Multicenter Validation in Brazilian Hospitals

Early recognition of clinical deterioration is one of the main steps for...
research
08/17/2022

Extracting Medication Changes in Clinical Narratives using Pre-trained Language Models

An accurate and detailed account of patient medications, including medic...
research
11/19/2014

Designing Optimal Mortality Risk Prediction Scores that Preserve Clinical Knowledge

Many in-hospital mortality risk prediction scores dichotomize predictive...
research
05/08/2023

Large-Scale Study of Temporal Shift in Health Insurance Claims

Most machine learning models for predicting clinical outcomes are develo...
research
06/18/2019

Integrated Visualization of Patient Data

The efficient and timely access to patient data is essential for success...
research
05/05/2023

All models are local: time to replace external validation with recurrent local validation

External validation is often recommended to ensure the generalizability ...

Please sign up or login with your details

Forgot password? Click here to reset