Deployment of Image Analysis Algorithms under Prevalence Shifts

03/22/2023
by   Patrick Godau, et al.
2

Domain gaps are among the most relevant roadblocks in the clinical translation of machine learning (ML)-based solutions for medical image analysis. While current research focuses on new training paradigms and network architectures, little attention is given to the specific effect of prevalence shifts on an algorithm deployed in practice. Such discrepancies between class frequencies in the data used for a method's development/validation and that in its deployment environment(s) are of great importance, for example in the context of artificial intelligence (AI) democratization, as disease prevalences may vary widely across time and location. Our contribution is twofold. First, we empirically demonstrate the potentially severe consequences of missing prevalence handling by analyzing (i) the extent of miscalibration, (ii) the deviation of the decision threshold from the optimum, and (iii) the ability of validation metrics to reflect neural network performance on the deployment population as a function of the discrepancy between development and deployment prevalence. Second, we propose a workflow for prevalence-aware image classification that uses estimated deployment prevalences to adjust a trained classifier to a new environment, without requiring additional annotated deployment data. Comprehensive experiments based on a diverse set of 30 medical classification tasks showcase the benefit of the proposed workflow in generating better classifier decisions and more reliable performance estimates compared to current practice.

READ FULL TEXT

page 2

page 3

research
12/29/2022

Current State of Community-Driven Radiological AI Deployment in Medical Imaging

Artificial Intelligence (AI) has become commonplace to solve routine eve...
research
03/06/2023

Evaluating the Fairness of Deep Learning Uncertainty Estimates in Medical Image Analysis

Although deep learning (DL) models have shown great success in many medi...
research
07/02/2022

Test-time Adaptation with Calibration of Medical Image Classification Nets for Label Distribution Shift

Class distribution plays an important role in learning deep classifiers....
research
03/08/2023

Deep Hypothesis Tests Detect Clinically Relevant Subgroup Shifts in Medical Images

Distribution shifts remain a fundamental problem for the safe applicatio...
research
02/01/2023

Model Monitoring and Robustness of In-Use Machine Learning Models: Quantifying Data Distribution Shifts Using Population Stability Index

Safety goes first. Meeting and maintaining industry safety standards for...
research
11/19/2019

A Framework for Challenge Design: Insight and Deployment Challenges to Address Medical Image Analysis Problems

In this paper we aim to refine the concept of grand challenges in medica...
research
07/31/2021

Bayesian analysis of the prevalence bias: learning and predicting from imbalanced data

Datasets are rarely a realistic approximation of the target population. ...

Please sign up or login with your details

Forgot password? Click here to reset