Data-Efficient Vision Transformers for Multi-Label Disease Classification on Chest Radiographs

08/17/2022
by   Finn Behrendt, et al.
17

Radiographs are a versatile diagnostic tool for the detection and assessment of pathologies, for treatment planning or for navigation and localization purposes in clinical interventions. However, their interpretation and assessment by radiologists can be tedious and error-prone. Thus, a wide variety of deep learning methods have been proposed to support radiologists interpreting radiographs. Mostly, these approaches rely on convolutional neural networks (CNN) to extract features from images. Especially for the multi-label classification of pathologies on chest radiographs (Chest X-Rays, CXR), CNNs have proven to be well suited. On the Contrary, Vision Transformers (ViTs) have not been applied to this task despite their high classification performance on generic images and interpretable local saliency maps which could add value to clinical interventions. ViTs do not rely on convolutions but on patch-based self-attention and in contrast to CNNs, no prior knowledge of local connectivity is present. While this leads to increased capacity, ViTs typically require an excessive amount of training data which represents a hurdle in the medical domain as high costs are associated with collecting large medical data sets. In this work, we systematically compare the classification performance of ViTs and CNNs for different data set sizes and evaluate more data-efficient ViT variants (DeiT). Our results show that while the performance between ViTs and CNNs is on par with a small benefit for ViTs, DeiTs outperform the former if a reasonably large data set is available for training.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/03/2023

Attention-based Saliency Maps Improve Interpretability of Pneumothorax Classification

Purpose: To investigate chest radiograph (CXR) classification performanc...
research
10/23/2022

Delving into Masked Autoencoders for Multi-Label Thorax Disease Classification

Vision Transformer (ViT) has become one of the most popular neural archi...
research
11/12/2019

Exploiting Clinically Available Delineations for CNN-based Segmentation in Radiotherapy Treatment Planning

Convolutional neural networks (CNNs) have been widely and successfully u...
research
05/08/2022

Preservation of High Frequency Content for Deep Learning-Based Medical Image Classification

Chest radiographs are used for the diagnosis of multiple critical illnes...
research
11/25/2020

Using Radiomics as Prior Knowledge for Abnormality Classification and Localization in Chest X-rays

Chest X-rays become one of the most common medical diagnoses due to its ...
research
09/29/2020

Trustworthy Convolutional Neural Networks: A Gradient Penalized-based Approach

Convolutional neural networks (CNNs) are commonly used for image classif...
research
07/16/2023

SHAMSUL: Simultaneous Heatmap-Analysis to investigate Medical Significance Utilizing Local interpretability methods

The interpretability of deep neural networks has become a subject of gre...

Please sign up or login with your details

Forgot password? Click here to reset