The HAM10000 Dataset: A Large Collection of Multi-Source Dermatoscopic Images of Common Pigmented Skin Lesions

03/28/2018
by   Philipp Tschandl, et al.
0

Training of neural networks for automated diagnosis of pigmented skin lesions is hampered by the small size and lack of diversity of available datasets of dermatoscopic images. We tackle this problem by releasing the HAM10000 ("Human Against Machine with 10000 training images") dataset. We collected dermatoscopic images from different populations acquired and stored by different modalities. Given this diversity we had to apply different acquisition and cleaning methods and developed semi-automatic workflows utilizing specifically trained neural networks. The final dataset consists of 11788 dermatoscopic images, of which 10010 will be released as a training set for academic machine learning purposes and will be publicly available through the ISIC archive. This benchmark dataset can be used for machine learning and for comparisons with human experts. Cases include a representative collection of all important diagnostic categories in the realm of pigmented lesions. More than 50 rest of the cases was either follow-up, expert consensus, or confirmation by in-vivo confocal microscopy.

READ FULL TEXT

page 3

page 4

page 11

08/06/2019

BCN20000: Dermoscopic Lesions in the Wild

This article summarizes the BCN20000 dataset, composed of 19424 dermosco...
10/29/2019

Estimating Skin Tone and Effects on Classification Performance in Dermatology Datasets

Recent advances in computer vision and deep learning have led to breakth...
05/02/2021

Detection and Longitudinal Tracking of Pigmented Skin Lesions in 3D Total-Body Skin Textured Meshes

We present an automated approach to detect and longitudinally track skin...
06/19/2020

Melanoma Diagnosis with Spatio-Temporal Feature Learning on Sequential Dermoscopic Images

Existing studies for automated melanoma diagnosis are based on single-ti...
05/20/2020

Risk of Training Diagnostic Algorithms on Data with Demographic Bias

One of the critical challenges in machine learning applications is to ha...
10/30/2010

Lesion Border Detection in Dermoscopy Images

Background: Dermoscopy is one of the major imaging modalities used in th...
10/14/2016

Deep Learning Ensembles for Melanoma Recognition in Dermoscopy Images

Melanoma is the deadliest form of skin cancer. While curable with early ...