Transformer-based end-to-end classification of variable-length volumetric data

07/13/2023
by   Marzieh Oghbaie, et al.
0

The automatic classification of 3D medical data is memory-intensive. Also, variations in the number of slices between samples is common. Naive solutions such as subsampling can solve these problems, but at the cost of potentially eliminating relevant diagnosis information. Transformers have shown promising performance for sequential data analysis. However, their application for long-sequences is data, computationally, and memory demanding. In this paper, we propose an end-to-end Transformer-based framework that allows to classify volumetric data of variable length in an efficient fashion. Particularly, by randomizing the input slice-wise resolution during training, we enhance the capacity of the learnable positional embedding assigned to each volume slice. Consequently, the accumulated positional information in each positional embedding can be generalized to the neighbouring slices, even for high resolution volumes at the test time. By doing so, the model will be more robust to variable volume length and amenable to different computational budgets. We evaluated the proposed approach in retinal OCT volume classification and achieved 21.96 diagnostic task, compared to state-of-the-art video transformers. Our findings show that varying the slice-wise resolution of the input during training results in more informative volume representation as compared to training with fixed number of slices per volume. Our code is available at: https://github.com/marziehoghbaie/VLFAT.

READ FULL TEXT
research
06/22/2022

SVoRT: Iterative Transformer for Slice-to-Volume Registration in Fetal Brain MRI

Volumetric reconstruction of fetal brains from multiple stacks of MR sli...
research
07/26/2020

Uniformizing Techniques to Process CT scans with 3D CNNs for Tuberculosis Prediction

A common approach to medical image analysis on volumetric data uses deep...
research
08/24/2023

IP-UNet: Intensity Projection UNet Architecture for 3D Medical Volume Segmentation

CNNs have been widely applied for medical image analysis. However, limit...
research
02/22/2021

Do We Really Need Explicit Position Encodings for Vision Transformers?

Almost all visual transformers such as ViT or DeiT rely on predefined po...
research
09/01/2023

On the Localization of Ultrasound Image Slices within Point Distribution Models

Thyroid disorders are most commonly diagnosed using high-resolution Ultr...
research
07/13/2023

Explainable 2D Vision Models for 3D Medical Data

Training Artificial Intelligence (AI) models on three-dimensional image ...
research
08/18/2023

A Lightweight Transformer for Faster and Robust EBSD Data Collection

Three dimensional electron back-scattered diffraction (EBSD) microscopy ...

Please sign up or login with your details

Forgot password? Click here to reset