CheXpert: A Large Chest Radiograph Dataset with Uncertainty Labels and Expert Comparison

by   Jeremy Irvin, et al.

Large, labeled datasets have driven deep learning methods to achieve expert-level performance on a variety of medical imaging tasks. We present CheXpert, a large dataset that contains 224,316 chest radiographs of 65,240 patients. We design a labeler to automatically detect the presence of 14 observations in radiology reports, capturing uncertainties inherent in radiograph interpretation. We investigate different approaches to using the uncertainty labels for training convolutional neural networks that output the probability of these observations given the available frontal and lateral radiographs. On a validation set of 200 chest radiographic studies which were manually annotated by 3 board-certified radiologists, we find that different uncertainty approaches are useful for different pathologies. We then evaluate our best model on a test set composed of 500 chest radiographic studies annotated by a consensus of 5 board-certified radiologists, and compare the performance of our model to that of 3 additional radiologists in the detection of 5 selected pathologies. On Cardiomegaly, Edema, and Pleural Effusion, the model ROC and PR curves lie above all 3 radiologist operating points. We release the dataset to the public as a standard benchmark to evaluate performance of chest radiograph interpretation models. The dataset is freely available at .


page 1

page 6


VinDr-PCXR: An open, large-scale chest radiograph dataset for interpretation of common thoracic diseases in children

Computer-aided diagnosis systems in adult chest radiography (CXR) have r...

MURA Dataset: Towards Radiologist-Level Abnormality Detection in Musculoskeletal Radiographs

We introduce MURA, a large dataset of musculoskeletal radiographs contai...

Cardiologist-Level Arrhythmia Detection with Convolutional Neural Networks

We develop an algorithm which exceeds the performance of board certified...

On the use of Mahalanobis distance for out-of-distribution detection with neural networks for medical imaging

Implementing neural networks for clinical use in medical applications ne...

Interpretation of Mammogram and Chest X-Ray Reports Using Deep Neural Networks - Preliminary Results

Radiology reports are an important means of communication between radiol...

Learning to detect chest radiographs containing lung nodules using visual attention networks

Machine learning approaches hold great potential for the automated detec...

Interpreting Chest X-rays via CNNs that Exploit Hierarchical Disease Dependencies and Uncertainty Labels

The chest X-rays (CXRs) is one of the views most commonly ordered by rad...

Please sign up or login with your details

Forgot password? Click here to reset