Evaluating the Fairness of Deep Learning Uncertainty Estimates in Medical Image Analysis

03/06/2023
by   Raghav Mehta, et al.
0

Although deep learning (DL) models have shown great success in many medical image analysis tasks, deployment of the resulting models into real clinical contexts requires: (1) that they exhibit robustness and fairness across different sub-populations, and (2) that the confidence in DL model predictions be accurately expressed in the form of uncertainties. Unfortunately, recent studies have indeed shown significant biases in DL models across demographic subgroups (e.g., race, sex, age) in the context of medical image analysis, indicating a lack of fairness in the models. Although several methods have been proposed in the ML literature to mitigate a lack of fairness in DL models, they focus entirely on the absolute performance between groups without considering their effect on uncertainty estimation. In this work, we present the first exploration of the effect of popular fairness models on overcoming biases across subgroups in medical image analysis in terms of bottom-line performance, and their effects on uncertainty quantification. We perform extensive experiments on three different clinically relevant tasks: (i) skin lesion classification, (ii) brain tumour segmentation, and (iii) Alzheimer's disease clinical score regression. Our results indicate that popular ML methods, such as data-balancing and distributionally robust optimization, succeed in mitigating fairness issues in terms of the model performances for some of the tasks. However, this can come at the cost of poor uncertainty estimates associated with the model predictions. This tradeoff must be mitigated if fairness models are to be adopted in medical image analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2022

Trustworthy clinical AI solutions: a unified review of uncertainty quantification in deep learning models for medical image analysis

The full acceptance of Deep Learning (DL) models in the clinical field i...
research
07/04/2023

Mitigating Calibration Bias Without Fixed Attribute Grouping for Improved Fairness in Medical Imaging Analysis

Trustworthy deployment of deep learning medical imaging models into real...
research
09/27/2022

A Survey of Fairness in Medical Image Analysis: Concepts, Algorithms, Evaluations, and Challenges

Fairness, a criterion focuses on evaluating algorithm performance on dif...
research
05/09/2023

Towards unraveling calibration biases in medical image analysis

In recent years the development of artificial intelligence (AI) systems ...
research
03/22/2023

Deployment of Image Analysis Algorithms under Prevalence Shifts

Domain gaps are among the most relevant roadblocks in the clinical trans...
research
08/08/2023

Learning Unbiased Image Segmentation: A Case Study with Plain Knee Radiographs

Automatic segmentation of knee bony anatomy is essential in orthopedics,...
research
08/14/2023

The Performance of Transferability Metrics does not Translate to Medical Tasks

Transfer learning boosts the performance of medical image analysis by en...

Please sign up or login with your details

Forgot password? Click here to reset