Multi-scale Attention U-Net (MsAUNet): A Modified U-Net Architecture for Scene Segmentation

09/15/2020
by   Soham Chattopadhyay, et al.
0

Despite the growing success of Convolution neural networks (CNN) in the recent past in the task of scene segmentation, the standard models lack some of the important features that might result in sub-optimal segmentation outputs. The widely used encoder-decoder architecture extracts and uses several redundant and low-level features at different steps and different scales. Also, these networks fail to map the long-range dependencies of local features, which results in discriminative feature maps corresponding to each semantic class in the resulting segmented image. In this paper, we propose a novel multi-scale attention network for scene segmentation purposes by using the rich contextual information from an image. Different from the original UNet architecture we have used attention gates which take the features from the encoder and the output of the pyramid pool as input and produced out-put is further concatenated with the up-sampled output of the previous pyramid-pool layer and mapped to the next subsequent layer. This network can map local features with their global counterparts with improved accuracy and emphasize on discriminative image regions by focusing on relevant local features only. We also propose a compound loss function by optimizing the IoU loss and fusing Dice Loss and Weighted Cross-entropy loss with it to achieve an optimal solution at a faster convergence rate. We have evaluated our model on two standard datasets named PascalVOC2012 and ADE20k and was able to achieve mean IoU of 79.88 result with the widely known models to prove the superiority of our model over them.

READ FULL TEXT

page 1

page 2

page 4

page 8

research
09/03/2020

Multi-Attention-Network for Semantic Segmentation of High-Resolution Remote Sensing Images

Semantic segmentation of remote sensing images plays an important role i...
research
06/07/2019

Multi-scale guided attention for medical image segmentation

Even though convolutional neural networks (CNNs) are driving progress in...
research
01/03/2018

Joint Optic Disc and Cup Segmentation Based on Multi-label Deep Network and Polar Transformation

Glaucoma is a chronic eye disease that leads to irreversible vision loss...
research
12/20/2020

MA-Unet: An improved version of Unet based on multi-scale and attention mechanism for medical image segmentation

Although convolutional neural networks (CNNs) are promoting the developm...
research
12/02/2020

CovSegNet: A Multi Encoder-Decoder Architecture for Improved Lesion Segmentation of COVID-19 Chest CT Scans

Automatic lung lesions segmentation of chest CT scans is considered a pi...
research
02/13/2019

Highly Efficient Follicular Segmentation in Thyroid Cytopathological Whole Slide Image

In this paper, we propose a novel method for highly efficient follicular...

Please sign up or login with your details

Forgot password? Click here to reset