Rethinking BiSeNet For Real-time Semantic Segmentation

04/27/2021
by   Mingyuan Fan, et al.
0

BiSeNet has been proved to be a popular two-stream network for real-time segmentation. However, its principle of adding an extra path to encode spatial information is time-consuming, and the backbones borrowed from pretrained tasks, e.g., image classification, may be inefficient for image segmentation due to the deficiency of task-specific design. To handle these problems, we propose a novel and efficient structure named Short-Term Dense Concatenate network (STDC network) by removing structure redundancy. Specifically, we gradually reduce the dimension of feature maps and use the aggregation of them for image representation, which forms the basic module of STDC network. In the decoder, we propose a Detail Aggregation module by integrating the learning of spatial information into low-level layers in single-stream manner. Finally, the low-level features and deep features are fused to predict the final segmentation results. Extensive experiments on Cityscapes and CamVid dataset demonstrate the effectiveness of our method by achieving promising trade-off between segmentation accuracy and inference speed. On Cityscapes, we achieve 71.9 which is 45.2 FPS while inferring on higher resolution images.

READ FULL TEXT

page 5

page 7

research
06/04/2023

Cross-CBAM: A Lightweight network for Scene Segmentation

Scene parsing is a great challenge for real-time semantic segmentation. ...
research
10/18/2021

FEANet: Feature-Enhanced Attention Network for RGB-Thermal Real-time Semantic Segmentation

The RGB-Thermal (RGB-T) information for semantic segmentation has been e...
research
04/05/2020

BiSeNet V2: Bilateral Network with Guided Aggregation for Real-time Semantic Segmentation

The low-level details and high-level semantics are both essential to the...
research
12/02/2022

DWRSeg: Dilation-wise Residual Network for Real-time Semantic Segmentation

Real-time semantic segmentation has played an important role in intellig...
research
07/26/2023

Unite-Divide-Unite: Joint Boosting Trunk and Structure for High-accuracy Dichotomous Image Segmentation

High-accuracy Dichotomous Image Segmentation (DIS) aims to pinpoint cate...
research
01/30/2023

CSDN: Combing Shallow and Deep Networks for Accurate Real-time Segmentation of High-definition Intravascular Ultrasound Images

Intravascular ultrasound (IVUS) is the preferred modality for capturing ...
research
07/09/2021

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for deca...

Please sign up or login with your details

Forgot password? Click here to reset