See More Than Once -- Kernel-Sharing Atrous Convolution for Semantic Segmentation

08/26/2019
by   Ye Huang, et al.
15

The state-of-the-art semantic segmentation solutions usually leverage different receptive fields via multiple parallel branches to handle objects with different sizes. However, employing separate kernels for individual branches degrades the generalization and representation abilities of the network, and the amount of parameters increases by the times of the number of branches. To tackle this problem, we propose a novel network structure namely Kernel-Sharing Atrous Convolution (KSAC), where branches of different receptive fields share the same kernel, i.e., let a single kernel `see' the input feature maps more than once with different receptive fields, to facilitate communication among branches and perform `feature augmentation' inside the network. Experiments conducted on the benchmark VOC 2012 dataset show that the proposed sharing strategy can not only boost network's generalization and representation abilities but also reduce the model complexity significantly. Specifically, when compared with DeepLabV3+ equipped with MobileNetv2 backbone, 33 Xception is used as the backbone, the mIOU is elevated from 83.34 with about 10M parameters saved. In addition, different from the widely used ASPP structure, our proposed KSAC is able to further improve the mIOU by taking benefit of wider context with larger atrous rates.

READ FULL TEXT

page 1

page 5

research
04/03/2018

Exploring Multi-Branch and High-Level Semantic Networks for Improving Pedestrian Detection

To better detect pedestrians of various scales, deep multi-scale methods...
research
12/21/2021

DRPN: Making CNN Dynamically Handle Scale Variation

Based on our observations of infrared targets, serious scale variation a...
research
03/08/2023

FCN+: Global Receptive Convolution Makes FCN Great Again

Fully convolutional network (FCN) is a seminal work for semantic segment...
research
03/15/2019

Selective Kernel Networks

In standard Convolutional Neural Networks (CNNs), the receptive fields o...
research
07/18/2020

Malleable 2.5D Convolution: Learning Receptive Fields along the Depth-axis for RGB-D Scene Parsing

Depth data provide geometric information that can bring progress in RGB-...
research
05/30/2022

Pooling Revisited: Your Receptive Field is Suboptimal

The size and shape of the receptive field determine how the network aggr...
research
04/02/2023

Progressive Random Convolutions for Single Domain Generalization

Single domain generalization aims to train a generalizable model with on...

Please sign up or login with your details

Forgot password? Click here to reset