The reliability of a deep learning model in clinical out-of-distribution MRI data: a multicohort study

11/01/2019
by   Gustav Mårtensson, et al.
0

Deep learning (DL) methods have in recent years yielded impressive results in medical imaging, with the potential to function as clinical aid to radiologists. However, DL models in medical imaging are often trained on public research cohorts with images acquired with a single scanner or with strict protocol harmonization, which is not representative of a clinical setting. The aim of this study was to investigate how well a DL model performs in unseen clinical data sets—collected with different scanners, protocols and disease populations—and whether more heterogeneous training data improves generalization. In total, 3117 MRI scans of brains from multiple dementia research cohorts and memory clinics, that had been visually rated by a neuroradiologist according to Scheltens' scale of medial temporal atrophy (MTA), were included in this study. By training multiple versions of a convolutional neural network on different subsets of this data to predict MTA ratings, we assessed the impact of including images from a wider distribution during training had on performance in external memory clinic data. Our results showed that our model generalized well to data sets acquired with similar protocols as the training data, but substantially worse in clinical cohorts with visibly different tissue contrasts in the images. This implies that future DL studies investigating performance in out-of-distribution (OOD) MRI data need to assess multiple external cohorts for reliable results. Further, by including data from a wider range of scanners and protocols the performance improved in OOD data, which suggests that more heterogeneous training data makes the model generalize better. To conclude, this is the most comprehensive study to date investigating the domain shift in deep learning on MRI data, and we advocate rigorous evaluation of DL models on clinical data prior to being certified for deployment.

READ FULL TEXT
research
10/13/2021

MedNet: Pre-trained Convolutional Neural Network Model for the Medical Imaging Tasks

Deep Learning (DL) requires a large amount of training data to provide q...
research
08/21/2023

DOMINO++: Domain-aware Loss Regularization for Deep Learning Generalizability

Out-of-distribution (OOD) generalization poses a serious challenge for m...
research
06/07/2019

When Unseen Domain Generalization is Unnecessary? Rethinking Data Augmentation

Recent advances in deep learning for medical image segmentation demonstr...
research
06/30/2022

Exposing and addressing the fragility of neural networks in digital pathology

Neural networks have achieved impressive results in many medical imaging...
research
07/07/2023

Effect of Intensity Standardization on Deep Learning for WML Segmentation in Multi-Centre FLAIR MRI

Deep learning (DL) methods for white matter lesion (WML) segmentation in...
research
12/13/2022

Solving Sample-Level Out-of-Distribution Detection on 3D Medical Images

Deep Learning (DL) models tend to perform poorly when the data comes fro...

Please sign up or login with your details

Forgot password? Click here to reset