ConvTransSeg: A Multi-resolution Convolution-Transformer Network for Medical Image Segmentation

10/13/2022
by   Zhendi Gong, et al.
0

Convolutional neural networks (CNNs) achieved the state-of-the-art performance in medical image segmentation due to their ability to extract highly complex feature representations. However, it is argued in recent studies that traditional CNNs lack the intelligence to capture long-term dependencies of different image regions. Following the success of applying Transformer models on natural language processing tasks, the medical image segmentation field has also witnessed growing interest in utilizing Transformers, due to their ability to capture long-range contextual information. However, unlike CNNs, Transformers lack the ability to learn local feature representations. Thus, to fully utilize the advantages of both CNNs and Transformers, we propose a hybrid encoder-decoder segmentation model (ConvTransSeg). It consists of a multi-layer CNN as the encoder for feature learning and the corresponding multi-level Transformer as the decoder for segmentation prediction. The encoder and decoder are interconnected in a multi-resolution manner. We compared our method with many other state-of-the-art hybrid CNN and Transformer segmentation models on binary and multiple class image segmentation tasks using several public medical image datasets, including skin lesion, polyp, cell and brain tissue. The experimental results show that our method achieves overall the best performance in terms of Dice coefficient and average symmetric surface distance measures with low model complexity and memory consumption. In contrast to most Transformer-based methods that we compared, our method does not require the use of pre-trained models to achieve similar or better performance. The code is freely available for research purposes on Github: (the link will be added upon acceptance).

READ FULL TEXT

page 5

page 10

research
07/18/2022

HiFormer: Hierarchical Multi-scale Representations Using Transformers for Medical Image Segmentation

Convolutional neural networks (CNNs) have been the consensus for medical...
research
08/14/2023

Large-kernel Attention for Efficient and Robust Brain Lesion Segmentation

Vision transformers are effective deep learning models for vision tasks,...
research
01/21/2022

Improving Across-Dataset Brain Tissue Segmentation Using Transformer

Brain tissue segmentation has demonstrated great utility in quantifying ...
research
08/21/2023

Switched auxiliary loss for robust training of transformer models for histopathological image segmentation

Functional tissue Units (FTUs) are cell population neighborhoods local t...
research
06/12/2023

AerialFormer: Multi-resolution Transformer for Aerial Image Segmentation

Aerial Image Segmentation is a top-down perspective semantic segmentatio...
research
02/24/2022

Factorizer: A Scalable Interpretable Approach to Context Modeling for Medical Image Segmentation

Convolutional Neural Networks (CNNs) with U-shaped architectures have do...
research
03/23/2023

A Permutable Hybrid Network for Volumetric Medical Image Segmentation

The advent of Vision Transformer (ViT) has brought substantial advanceme...

Please sign up or login with your details

Forgot password? Click here to reset