(De)Constructing Bias on Skin Lesion Datasets

04/18/2019
by   Alceu Bissoto, et al.
22

Melanoma is the deadliest form of skin cancer. Automated skin lesion analysis plays an important role for early detection. Nowadays, the ISIC Archive and the Atlas of Dermoscopy dataset are the most employed skin lesion sources to benchmark deep-learning based tools. However, all datasets contain biases, often unintentional, due to how they were acquired and annotated. Those biases distort the performance of machine-learning models, creating spurious correlations that the models can unfairly exploit, or, contrarily destroying cogent correlations that the models could learn. In this paper, we propose a set of experiments that reveal both types of biases, positive and negative, in existing skin lesion datasets. Our results show that models can correctly classify skin lesion images without clinically-meaningful information: disturbingly, the machine-learning model learned over images where no information about the lesion remains, presents an accuracy above the AI benchmark curated with dermatologists' performances. That strongly suggests spurious correlations guiding the models. We fed models with additional clinically meaningful information, which failed to improve the results even slightly, suggesting the destruction of cogent correlations. Our main findings raise awareness of the limitations of models trained and evaluated in small datasets such as the ones we evaluated, and may suggest future guidelines for models intended for real-world deployment.

READ FULL TEXT

page 4

page 6

page 7

page 8

research
10/04/2020

Improving Lesion Detection by exploring bias on Skin Lesion dataset

All datasets contain some biases, often unintentional, due to how they w...
research
04/23/2020

Debiasing Skin Lesion Datasets and Models? Not So Fast

Data-driven models are now deployed in a plethora of real-world applicat...
research
02/08/2019

Skin Lesion Synthesis with Generative Adversarial Networks

Skin cancer is by far the most common type of cancer. Early detection is...
research
08/10/2023

Test-Time Selection for Robust Skin Lesion Analysis

Skin lesion analysis models are biased by artifacts placed during image ...
research
08/20/2022

Artifact-Based Domain Generalization of Skin Lesion Models

Deep Learning failure cases are abundant, particularly in the medical ar...
research
10/05/2018

Model Cards for Model Reporting

Trained machine learning models are increasingly used to perform high-im...
research
06/13/2022

Revisiting the Shape-Bias of Deep Learning for Dermoscopic Skin Lesion Classification

It is generally believed that the human visual system is biased towards ...

Please sign up or login with your details

Forgot password? Click here to reset