The HAM10000 Dataset: A Large Collection of Multi-Source Dermatoscopic Images of Common Pigmented Skin Lesions

by   Philipp Tschandl, et al.

Training of neural networks for automated diagnosis of pigmented skin lesions is hampered by the small size and lack of diversity of available datasets of dermatoscopic images. We tackle this problem by releasing the HAM10000 ("Human Against Machine with 10000 training images") dataset. We collected dermatoscopic images from different populations acquired and stored by different modalities. Given this diversity we had to apply different acquisition and cleaning methods and developed semi-automatic workflows utilizing specifically trained neural networks. The final dataset consists of 11788 dermatoscopic images, of which 10010 will be released as a training set for academic machine learning purposes and will be publicly available through the ISIC archive. This benchmark dataset can be used for machine learning and for comparisons with human experts. Cases include a representative collection of all important diagnostic categories in the realm of pigmented lesions. More than 50 rest of the cases was either follow-up, expert consensus, or confirmation by in-vivo confocal microscopy.


page 3

page 4

page 11


BCN20000: Dermoscopic Lesions in the Wild

This article summarizes the BCN20000 dataset, composed of 19424 dermosco...

Estimating Skin Tone and Effects on Classification Performance in Dermatology Datasets

Recent advances in computer vision and deep learning have led to breakth...

Detection and Longitudinal Tracking of Pigmented Skin Lesions in 3D Total-Body Skin Textured Meshes

We present an automated approach to detect and longitudinally track skin...

Melanoma Diagnosis with Spatio-Temporal Feature Learning on Sequential Dermoscopic Images

Existing studies for automated melanoma diagnosis are based on single-ti...

Risk of Training Diagnostic Algorithms on Data with Demographic Bias

One of the critical challenges in machine learning applications is to ha...

Lesion Border Detection in Dermoscopy Images

Background: Dermoscopy is one of the major imaging modalities used in th...

Deep Learning Ensembles for Melanoma Recognition in Dermoscopy Images

Melanoma is the deadliest form of skin cancer. While curable with early ...