Dynamic deformable attention (DDANet) for semantic segmentation

08/25/2020
by   Kumar Rajamani, et al.
0

Deep learning based medical image segmentation is an important step within diagnosis, which relies strongly on capturing sufficient spatial context without requiring too complex models that are hard to train with limited labelled data. Training data is in particular scarce for segmenting infection regions of CT images of COVID-19 patients. Attention models help gather contextual information within deep networks and benefit semantic segmentation tasks. The recent criss-cross-attention module aims to approximate global self-attention while remaining memory and time efficient by separating horizontal and vertical self-similarity computations. However, capturing attention from all non-local locations can adversely impact the accuracy of semantic segmentation networks. We propose a new Dynamic Deformable Attention Network (DDANet) that enables a more accurate contextual information computation in a similarly efficient way. Our novel technique is based on a deformable criss-cross attention block that learns both attention coefficients and attention offsets in a continuous way. A deep segmentation network (in our case a U-Net \cite{Jo2019}) that employs this attention mechanism is able to capture attention from pertinent non-local locations and also improves the performance on semantic segmentation tasks compared to criss-cross attention within a U-Net on a challenging COVID-19 lesion segmentation task. Our validation experiments show that the performance gain of the recursively applied dynamic deformable attention blocks comes from their ability to capture dynamic and precise (wider) attention context. Our DDANet achieves Dice scores of 73.4% and 61.3% for Ground-Glass-Opacity and Consolidation lesions for COVID-19 segmentation and improves the accuracy by 4.9% points compared to a baseline U-Net.

READ FULL TEXT

page 2

page 3

page 7

page 11

page 12

research
09/04/2021

Sparse Spatial Attention Network for Semantic Segmentation

The spatial attention mechanism captures long-range dependencies by aggr...
research
07/01/2019

Permutohedral Attention Module for Efficient Non-Local Neural Networks

Medical image processing tasks such as segmentation often require captur...
research
04/19/2019

Deep Q Learning Driven CT Pancreas Segmentation with Geometry-Aware U-Net

Segmentation of pancreas is important for medical image analysis, yet it...
research
08/31/2023

Self-supervised Semantic Segmentation: Consistency over Transformation

Accurate medical image segmentation is of utmost importance for enabling...
research
03/10/2021

AttaNet: Attention-Augmented Network for Fast and Accurate Scene Parsing

Two factors have proven to be very important to the performance of seman...
research
04/16/2020

Contextual Two-Stage U-Nets for Robust Pulmonary Lobe Segmentation in CT Scans of COVID-19 and COPD Patients

Pulmonary lobe segmentation in computed tomography scans is essential fo...
research
01/19/2021

Comparative Evaluation of 3D and 2D Deep Learning Techniques for Semantic Segmentation in CT Scans

Image segmentation plays a pivotal role in several medical-imaging appli...

Please sign up or login with your details

Forgot password? Click here to reset