Robustness Stress Testing in Medical Image Classification

08/14/2023
by   Mobarakol Islam, et al.
0

Deep neural networks have shown impressive performance for image-based disease detection. Performance is commonly evaluated through clinical validation on independent test sets to demonstrate clinically acceptable accuracy. Reporting good performance metrics on test sets, however, is not always a sufficient indication of the generalizability and robustness of an algorithm. In particular, when the test data is drawn from the same distribution as the training data, the iid test set performance can be an unreliable estimate of the accuracy on new data. In this paper, we employ stress testing to assess model robustness and subgroup performance disparities in disease detection models. We design progressive stress testing using five different bidirectional and unidirectional image perturbations with six different severity levels. As a use case, we apply stress tests to measure the robustness of disease detection models for chest X-ray and skin lesion images, and demonstrate the importance of studying class and domain-specific model behaviour. Our experiments indicate that some models may yield more robust and equitable performance than others. We also find that pretraining characteristics play an important role in downstream robustness. We conclude that progressive stress testing is a viable and important tool and should become standard practice in the clinical validation of image-based disease detection models.

READ FULL TEXT
research
04/15/2021

Out-of-Distribution Detection for Dermoscopic Image Classification

Medical image diagnosis can be achieved by deep neural networks, provide...
research
04/25/2022

Robust inference for non-destructive one-shot device testing under step-stress model with exponential lifetimes

One-shot devices analysis involves an extreme case of interval censoring...
research
04/04/2022

Feature robustness and sex differences in medical imaging: a case study in MRI-based Alzheimer's disease detection

Convolutional neural networks have enabled significant improvements in m...
research
04/23/2020

Local Adaptation Improves Accuracy of Deep Learning Model for Automated X-Ray Thoracic Disease Detection : A Thai Study

Despite much promising research in the area of artificial intelligence f...
research
07/02/2022

Test-time Adaptation with Calibration of Medical Image Classification Nets for Label Distribution Shift

Class distribution plays an important role in learning deep classifiers....
research
04/08/2020

The Adaptive Stress Testing Formulation

Validation is a key challenge in the search for safe autonomy. Simulatio...
research
12/28/2022

Anxolotl, an Anxiety Companion App – Stress Detection

Stress has a great effect on people's lives that can not be understated....

Please sign up or login with your details

Forgot password? Click here to reset