ConvFormer: Combining CNN and Transformer for Medical Image Segmentation

11/15/2022
by   Pengfei Gu, et al.
0

Convolutional neural network (CNN) based methods have achieved great successes in medical image segmentation, but their capability to learn global representations is still limited due to using small effective receptive fields of convolution operations. Transformer based methods are capable of modelling long-range dependencies of information for capturing global representations, yet their ability to model local context is lacking. Integrating CNN and Transformer to learn both local and global representations while exploring multi-scale features is instrumental in further improving medical image segmentation. In this paper, we propose a hierarchical CNN and Transformer hybrid architecture, called ConvFormer, for medical image segmentation. ConvFormer is based on several simple yet effective designs. (1) A feed forward module of Deformable Transformer (DeTrans) is re-designed to introduce local information, called Enhanced DeTrans. (2) A residual-shaped hybrid stem based on a combination of convolutions and Enhanced DeTrans is developed to capture both local and global representations to enhance representation ability. (3) Our encoder utilizes the residual-shaped hybrid stem in a hierarchical manner to generate feature maps in different scales, and an additional Enhanced DeTrans encoder with residual connections is built to exploit multi-scale features with feature maps of different scales as input. Experiments on several datasets show that our ConvFormer, trained from scratch, outperforms various CNN- or Transformer-based architectures, achieving state-of-the-art performance.

READ FULL TEXT
research
09/15/2021

MISSFormer: An Effective Medical Image Segmentation Transformer

The CNN-based methods have achieved impressive results in medical image ...
research
02/18/2023

Hyneter: Hybrid Network Transformer for Object Detection

In this paper, we point out that the essential differences between CNN-b...
research
03/09/2022

PHTrans: Parallelly Aggregating Global and Local Representations for Medical Image Segmentation

The success of Transformer in computer vision has attracted increasing a...
research
12/19/2022

Focal-UNet: UNet-like Focal Modulation for Medical Image Segmentation

Recently, many attempts have been made to construct a transformer base U...
research
10/11/2022

UGformer for Robust Left Atrium and Scar Segmentation Across Scanners

Thanks to the capacity for long-range dependencies and robustness to irr...
research
04/14/2022

3D Shuffle-Mixer: An Efficient Context-Aware Vision Learner of Transformer-MLP Paradigm for Dense Prediction in Medical Volume

Dense prediction in medical volume provides enriched guidance for clinic...
research
07/17/2022

Defect Transformer: An Efficient Hybrid Transformer Architecture for Surface Defect Detection

Surface defect detection is an extremely crucial step to ensure the qual...

Please sign up or login with your details

Forgot password? Click here to reset