Confounding variables can degrade generalization performance of radiological deep learning models

07/02/2018
by   John R. Zech, et al.
0

Early results in using convolutional neural networks (CNNs) on x-rays to diagnose disease have been promising, but it has not yet been shown that models trained on x-rays from one hospital or one group of hospitals will work equally well at different hospitals. Before these tools are used for computer-aided diagnosis in real-world clinical settings, we must verify their ability to generalize across a variety of hospital systems. A cross-sectional design was used to train and evaluate pneumonia screening CNNs on 158,323 chest x-rays from NIH (n=112,120 from 30,805 patients), Mount Sinai (42,396 from 12,904 patients), and Indiana (n=3,807 from 3,683 patients). In 3 / 5 natural comparisons, performance on chest x-rays from outside hospitals was significantly lower than on held-out x-rays from the original hospital systems. CNNs were able to detect where an x-ray was acquired (hospital system, hospital department) with extremely high accuracy and calibrate predictions accordingly. The performance of CNNs in diagnosing diseases on x-rays may reflect not only their ability to identify disease-specific imaging findings on x-rays, but also their ability to exploit confounding information. Estimates of CNN performance based on test data from hospital systems used for model training may overstate their likely real-world performance.

READ FULL TEXT

page 5

page 10

research
01/13/2020

An Adversarial Approach for the Robust Classification of Pneumonia from Chest Radiographs

While deep learning has shown promise in the domain of disease classific...
research
05/05/2022

Multi-confound regression adversarial network for deep learning-based diagnosis on highly heterogenous clinical data

Automated disease detection in medical images using deep learning holds ...
research
11/08/2018

Deep Learning Predicts Hip Fracture using Confounding Patient and Healthcare Variables

Hip fractures are a leading cause of death and disability among older ad...
research
04/06/2021

A clinical validation of VinDr-CXR, an AI system for detecting abnormal chest radiographs

Computer-Aided Diagnosis (CAD) systems for chest radiographs using artif...
research
11/13/2022

Early Diagnosis of Chronic Obstructive Pulmonary Disease from Chest X-Rays using Transfer Learning and Fusion Strategies

Chronic obstructive pulmonary disease (COPD) is one of the most common c...
research
12/01/2021

MOMO – Deep Learning-driven classification of external DICOM studies for PACS archivation

Patients regularly continue assessment or treatment in other facilities ...

Please sign up or login with your details

Forgot password? Click here to reset