Reinventing 2D Convolutions for 3D Medical Images

11/24/2019
by   Jiancheng Yang, et al.
0

There has been considerable debate over 2D and 3D representation learning on 3D medical images. 2D approaches could benefit from large-scale 2D pretraining, whereas they are generally weak in capturing large 3D contexts. 3D approaches are natively strong in 3D contexts, however few publicly available 3D medical dataset is large and diverse enough for universal 3D pretraining. Even for hybrid (2D + 3D) approaches, the intrinsic disadvantages within the 2D / 3D parts still exist. In this study, we bridge the gap between 2D and 3D convolutions by reinventing the 2D convolutions. We propose ACS (axial-coronal-sagittal) convolutions to perform natively 3D representation learning, while utilizing the pretrained weights from 2D counterparts. In ACS convolutions, 2D convolution kernels are split by channel into three parts, and convoluted separately on the three views (axial, coronal and sagittal) of 3D representations. Theoretically, ANY 2D CNN (ResNet, DenseNet, or DeepLab) is able to be converted into a 3D ACS CNN, with pretrained weights of same parameter sizes. Extensive experiments on proof-of-concept dataset and several medical benchmarks validate the consistent superiority of the pretrained ACS CNNs, over the 2D / 3D CNN counterparts with / without pretraining. Even without pretraining, the ACS convolution can be used as a plug-and-play replacement of standard 3D convolution, with smaller model size.

READ FULL TEXT
research
07/15/2020

Comparing to Learn: Surpassing ImageNet Pretraining on Radiographs By Comparing Image Representations

In deep learning era, pretrained models play an important role in medica...
research
04/02/2023

Video Pretraining Advances 3D Deep Learning on Chest CT Tasks

Pretraining on large natural image classification datasets such as Image...
research
11/25/2020

Effective Sample Pair Generation for Ultrasound Video Contrastive Representation Learning

Most deep neural networks (DNNs) based ultrasound (US) medical image ana...
research
09/17/2021

Asymmetric 3D Context Fusion for Universal Lesion Detection

Modeling 3D context is essential for high-performance 3D medical image a...
research
05/05/2022

Segmentation with Super Images: A New 2D Perspective on 3D Medical Image Analysis

Deep learning is showing an increasing number of audience in medical ima...
research
10/23/2019

Speech-XLNet: Unsupervised Acoustic Model Pretraining For Self-Attention Networks

Self-attention network (SAN) can benefit significantly from the bi-direc...

Please sign up or login with your details

Forgot password? Click here to reset