MaxViT-UNet: Multi-Axis Attention for Medical Image Segmentation

05/15/2023
by   Abdul Rehman, et al.
0

Convolutional neural networks have made significant strides in medical image analysis in recent years. However, the local nature of the convolution operator inhibits the CNNs from capturing global and long-range interactions. Recently, Transformers have gained popularity in the computer vision community and also medical image segmentation. But scalability issues of self-attention mechanism and lack of the CNN like inductive bias have limited their adoption. In this work, we present MaxViT-UNet, an Encoder-Decoder based hybrid vision transformer for medical image segmentation. The proposed hybrid decoder, also based on MaxViT-block, is designed to harness the power of convolution and self-attention mechanism at each decoding stage with minimal computational burden. The multi-axis self-attention in each decoder stage helps in differentiating between the object and background regions much more efficiently. The hybrid decoder block initially fuses the lower level features upsampled via transpose convolution, with skip-connection features coming from hybrid encoder, then fused features are refined using multi-axis attention mechanism. The proposed decoder block is repeated multiple times to accurately segment the nuclei regions. Experimental results on MoNuSeg dataset proves the effectiveness of the proposed technique. Our MaxViT-UNet outperformed the previous CNN only (UNet) and Transformer only (Swin-UNet) techniques by a large margin of 2.36

READ FULL TEXT

page 16

page 17

research
07/02/2021

UTNet: A Hybrid Transformer Architecture for Medical Image Segmentation

Transformer architecture has emerged to be successful in a number of nat...
research
10/12/2021

MEDUSA: Multi-scale Encoder-Decoder Self-Attention Deep Neural Network Architecture for Medical Image Analysis

Medical image analysis continues to hold interesting challenges given th...
research
09/02/2021

Studying the Effects of Self-Attention for Medical Image Analysis

When the trained physician interprets medical images, they understand th...
research
12/31/2021

CSformer: Bridging Convolution and Transformer for Compressive Sensing

Convolution neural networks (CNNs) have succeeded in compressive image s...
research
06/19/2023

SegT: A Novel Separated Edge-guidance Transformer Network for Polyp Segmentation

Accurate segmentation of colonoscopic polyps is considered a fundamental...
research
11/17/2022

Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

Transformers have achieved remarkable success in medical image analysis ...
research
02/19/2023

MedViT: A Robust Vision Transformer for Generalized Medical Image Classification

Convolutional Neural Networks (CNNs) have advanced existing medical syst...

Please sign up or login with your details

Forgot password? Click here to reset