U-Net vs Transformer: Is U-Net Outdated in Medical Image Registration?

08/07/2022
by   Xi Jia, et al.
0

Due to their extreme long-range modeling capability, vision transformer-based networks have become increasingly popular in deformable image registration. We believe, however, that the receptive field of a 5-layer convolutional U-Net is sufficient to capture accurate deformations without needing long-range dependencies. The purpose of this study is therefore to investigate whether U-Net-based methods are outdated compared to modern transformer-based approaches when applied to medical image registration. For this, we propose a large kernel U-Net (LKU-Net) by embedding a parallel convolutional block to a vanilla U-Net in order to enhance the effective receptive field. On the public 3D IXI brain dataset for atlas-based registration, we show that the performance of the vanilla U-Net is already comparable with that of state-of-the-art transformer-based networks (such as TransMorph), and that the proposed LKU-Net outperforms TransMorph by using only 1.12 mult-adds operations. We further evaluate LKU-Net on a MICCAI Learn2Reg 2021 challenge dataset for inter-subject registration, our LKU-Net also outperforms TransMorph on this dataset and ranks first on the public leaderboard as of the submission of this work. With only modest modifications to the vanilla U-Net, we show that U-Net can outperform transformer-based architectures on inter-subject and atlas-based 3D medical image registration. Code is available at https://github.com/xi-jia/LKU-Net.

READ FULL TEXT
research
04/28/2022

Symmetric Transformer-based Network for Unsupervised Image Registration

Medical image registration is a fundamental and critical task in medical...
research
11/08/2021

Mixed Transformer U-Net For Medical Image Segmentation

Though U-Net has achieved tremendous success in medical image segmentati...
research
06/09/2023

ModeT: Learning Deformable Image Registration via Motion Decomposition Transformer

The Transformer structures have been widely used in computer vision and ...
research
07/06/2023

Fourier-Net+: Leveraging Band-Limited Representation for Efficient 3D Medical Image Registration

U-Net style networks are commonly utilized in unsupervised image registr...
research
04/13/2021

ViT-V-Net: Vision Transformer for Unsupervised Volumetric Medical Image Registration

In the last decade, convolutional neural networks (ConvNets) have domina...
research
06/15/2022

XMorpher: Full Transformer for Deformable Medical Image Registration via Cross Attention

An effective backbone network is important to deep learning-based Deform...
research
11/29/2022

Fourier-Net: Fast Image Registration with Band-limited Deformation

Unsupervised image registration commonly adopts U-Net style networks to ...

Please sign up or login with your details

Forgot password? Click here to reset