A Holistically-Guided Decoder for Deep Representation Learning with Applications to Semantic Segmentation and Object Detection

12/18/2020
by   Jianbo Liu, et al.
0

Both high-level and high-resolution feature representations are of great importance in various visual understanding tasks. To acquire high-resolution feature maps with high-level semantic information, one common strategy is to adopt dilated convolutions in the backbone networks to extract high-resolution feature maps, such as the dilatedFCN-based methods for semantic segmentation. However, due to many convolution operations are conducted on the high-resolution feature maps, such methods have large computational complexity and memory consumption. In this paper, we propose one novel holistically-guided decoder which is introduced to obtain the high-resolution semantic-rich feature maps via the multi-scale features from the encoder. The decoding is achieved via novel holistic codeword generation and codeword assembly operations, which take advantages of both the high-level and low-level features from the encoder features. With the proposed holistically-guided decoder, we implement the EfficientFCN architecture for semantic segmentation and HGD-FPN for object detection and instance segmentation. The EfficientFCN achieves comparable or even better performance than state-of-the-art methods with only 1/3 of their computational costs for semantic segmentation on PASCAL Context, PASCAL VOC, ADE20K datasets. Meanwhile, the proposed HGD-FPN achieves >2% higher mean Average Precision (mAP) when integrated into several object detection frameworks with ResNet-50 encoding backbones.

READ FULL TEXT

page 1

page 2

page 7

page 10

page 11

research
08/24/2020

EfficientFCN: Holistically-guided Decoding for Semantic Segmentation

Both performance and efficiency are important to semantic segmentation. ...
research
03/15/2023

HFGD: High-level Feature Guided Decoder for Semantic Segmentation

Commonly used backbones for semantic segmentation, such as ResNet and Sw...
research
12/26/2019

Colorectal Polyp Segmentation by U-Net with Dilation Convolution

Colorectal cancer (CRC) is one of the most commonly diagnosed cancers an...
research
11/02/2022

Semantic SuperPoint: A Deep Semantic Descriptor

Several SLAM methods benefit from the use of semantic information. Most ...
research
09/03/2019

HarDNet: A Low Memory Traffic Network

State-of-the-art neural network architectures such as ResNet, MobileNet,...
research
07/05/2022

Improving Semantic Segmentation in Transformers using Hierarchical Inter-Level Attention

Existing transformer-based image backbones typically propagate feature i...
research
11/10/2020

MP-ResNet: Multi-path Residual Network for the Semantic segmentation of High-Resolution PolSAR Images

There are limited studies on the semantic segmentation of high-resolutio...

Please sign up or login with your details

Forgot password? Click here to reset