INSIDE: Steering Spatial Attention with Non-Imaging Information in CNNs

08/21/2020
by   Grzegorz Jacenkow, et al.
0

We consider the problem of integrating non-imaging information into segmentation networks to improve performance. Conditioning layers such as FiLM provide the means to selectively amplify or suppress the contribution of different feature maps in a linear fashion. However, spatial dependency is difficult to learn within a convolutional paradigm. In this paper, we propose a mechanism to allow for spatial localisation conditioned on non-imaging information, using a feature-wise attention mechanism comprising a differentiable parametrised function (e.g. Gaussian), prior to applying the feature-wise modulation. We name our method INstance modulation with SpatIal DEpendency (INSIDE). The conditioning information might comprise any factors that relate to spatial or spatio-temporal information such as lesion location, size, and cardiac cycle phase. Our method can be trained end-to-end and does not require additional supervision. We evaluate the method on two datasets: a new CLEVR-Seg dataset where we segment objects based on location, and the ACDC dataset conditioned on cardiac phase and slice location within the volume. Code and the CLEVR-Seg dataset are available at https://github.com/jacenkow/inside.

READ FULL TEXT

page 6

page 8

page 11

page 12

research
12/03/2021

Echocardiography Segmentation with Enforced Temporal Consistency

Convolutional neural networks (CNN) have demonstrated their ability to s...
research
08/02/2022

A New Probabilistic V-Net Model with Hierarchical Spatial Feature Transform for Efficient Abdominal Multi-Organ Segmentation

Accurate and robust abdominal multi-organ segmentation from CT imaging o...
research
09/06/2022

PTSEFormer: Progressive Temporal-Spatial Enhanced TransFormer Towards Video Object Detection

Recent years have witnessed a trend of applying context frames to boost ...
research
03/12/2022

Deformable VisTR: Spatio temporal deformable attention for video instance segmentation

Video instance segmentation (VIS) task requires classifying, segmenting,...
research
07/06/2022

Light-weight spatio-temporal graphs for segmentation and ejection fraction prediction in cardiac ultrasound

Accurate and consistent predictions of echocardiography parameters are i...
research
03/08/2022

Locate This, Not That: Class-Conditioned Sound Event DOA Estimation

Existing systems for sound event localization and detection (SELD) typic...
research
03/01/2022

There is a Time and Place for Reasoning Beyond the Image

Images are often more significant than only the pixels to human eyes, as...

Please sign up or login with your details

Forgot password? Click here to reset