Pyramid Medical Transformer for Medical Image Segmentation

04/29/2021
by   Zhuangzhuang Zhang, et al.
0

Deep neural networks have been a prevailing technique in the field of medical image processing. However, the most popular convolutional neural networks (CNNs) based methods for medical image segmentation are imperfect because they cannot adequately model long-range pixel relations. Transformers and the self-attention mechanism are recently proposed to effectively learn long-range dependencies by modeling all pairs of word-to-word attention regardless of their positions. The idea has also been extended to the computer vision field by creating and treating image patches as embeddings. Considering the computation complexity for whole image self-attention, current transformer-based models settle for a rigid partitioning scheme that would potentially lose informative relations. Besides, current medical transformers model global context on full resolution images, leading to unnecessary computation costs. To address these issues, we developed a novel method to integrate multi-scale attention and CNN feature extraction using a pyramidal network architecture, namely Pyramid Medical Transformer (PMTrans). The PMTrans captured multi-range relations by working on multi-resolution images. An adaptive partitioning scheme was implemented to retain informative relations and to access different receptive fields efficiently. Experimental results on two medical image datasets, gland segmentation and MoNuSeg datasets, showed that PMTrans outperformed the latest CNN-based and transformer-based models for medical image segmentation.

READ FULL TEXT
research
01/03/2022

D-Former: A U-shaped Dilated Transformer for 3D Medical Image Segmentation

Computer-aided medical image segmentation has been applied widely in dia...
research
08/07/2023

Improving FHB Screening in Wheat Breeding Using an Efficient Transformer Model

Fusarium head blight is a devastating disease that causes significant ec...
research
01/26/2022

Class-Aware Generative Adversarial Transformers for Medical Image Segmentation

Transformers have made remarkable progress towards modeling long-range d...
research
03/03/2023

PPCR: Learning Pyramid Pixel Context Recalibration Module for Medical Image Classification

Spatial attention mechanism has been widely incorporated into deep convo...
research
02/24/2022

Factorizer: A Scalable Interpretable Approach to Context Modeling for Medical Image Segmentation

Convolutional Neural Networks (CNNs) with U-shaped architectures have do...
research
05/21/2022

Transformer based Generative Adversarial Network for Liver Segmentation

Automated liver segmentation from radiology scans (CT, MRI) can improve ...
research
04/13/2021

ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

In the last decade, convolutional neural networks (ConvNets) have domina...

Please sign up or login with your details

Forgot password? Click here to reset