How Can CNNs Use Image Position for Segmentation?

05/07/2020
by   Rito Murase, et al.
0

Convolution is an equivariant operation, and image position does not affect its result. A recent study shows that the zero-padding employed in convolutional layers of CNNs provides position information to the CNNs. The study further claims that the position information enables accurate inference for several tasks, such as object recognition, segmentation, etc. However, there is a technical issue with the design of the experiments of the study, and thus the correctness of the claim is yet to be verified. Moreover, the absolute image position may not be essential for the segmentation of natural images, in which target objects will appear at any image position. In this study, we investigate how positional information is and can be utilized for segmentation tasks. Toward this end, we consider positional encoding (PE) that adds channels embedding image position to the input images and compare PE with several padding methods. Considering the above nature of natural images, we choose medical image segmentation tasks, in which the absolute position appears to be relatively important, as the same organs (of different patients) are captured in similar sizes and positions. We draw a mixed conclusion from the experimental results; the positional encoding certainly works in some cases, but the absolute image position may not be so important for segmentation tasks as we think.

READ FULL TEXT
research
01/28/2021

Position, Padding and Predictions: A Deeper Look at Position Information in CNNs

In contrast to fully connected networks, Convolutional Neural Networks (...
research
10/13/2020

Exploring Efficient Volumetric Medical Image Segmentation Using 2.5D Method: An Empirical Study

With the unprecedented developments in deep learning, many methods are p...
research
03/16/2022

CapsNet for Medical Image Segmentation

Convolutional Neural Networks (CNNs) have been successful in solving tas...
research
10/23/2022

The Curious Case of Absolute Position Embeddings

Transformer language models encode the notion of word order using positi...
research
11/17/2022

Parameter-Efficient Transformer with Hybrid Axial-Attention for Medical Image Segmentation

Transformers have achieved remarkable success in medical image analysis ...
research
09/13/2021

SHAPE: Shifted Absolute Position Embedding for Transformers

Position representation is crucial for building position-aware represent...
research
07/02/2020

PGD-UNet: A Position-Guided Deformable Network for Simultaneous Segmentation of Organs and Tumors

Precise segmentation of organs and tumors plays a crucial role in clinic...

Please sign up or login with your details

Forgot password? Click here to reset