Unite-Divide-Unite: Joint Boosting Trunk and Structure for High-accuracy Dichotomous Image Segmentation

07/26/2023
by   Jialun Pei, et al.
0

High-accuracy Dichotomous Image Segmentation (DIS) aims to pinpoint category-agnostic foreground objects from natural scenes. The main challenge for DIS involves identifying the highly accurate dominant area while rendering detailed object structure. However, directly using a general encoder-decoder architecture may result in an oversupply of high-level features and neglect the shallow spatial information necessary for partitioning meticulous structures. To fill this gap, we introduce a novel Unite-Divide-Unite Network (UDUN that restructures and bipartitely arranges complementary features to simultaneously boost the effectiveness of trunk and structure identification. The proposed UDUN proceeds from several strengths. First, a dual-size input feeds into the shared backbone to produce more holistic and detailed features while keeping the model lightweight. Second, a simple Divide-and-Conquer Module (DCM) is proposed to decouple multiscale low- and high-level features into our structure decoder and trunk decoder to obtain structure and trunk information respectively. Moreover, we design a Trunk-Structure Aggregation module (TSA) in our union decoder that performs cascade integration for uniform high-accuracy segmentation. As a result, UDUN performs favorably against state-of-the-art competitors in all six evaluation metrics on overall DIS-TE, i.e., achieving 0.772 weighted F-measure and 977 HCE. Using 1024*1024 input, our model enables real-time inference at 65.3 fps with ResNet-18.

READ FULL TEXT

page 1

page 4

page 7

page 8

research
05/10/2022

STDC-MA Network for Semantic Segmentation

Semantic segmentation is applied extensively in autonomous driving and i...
research
08/16/2021

Polyp-PVT: Polyp Segmentation with Pyramid Vision Transformers

Most polyp segmentation methods use CNNs as their backbone, leading to t...
research
01/15/2019

Cascade Decoder: A Universal Decoding Method for Biomedical Image Segmentation

The Encoder-Decoder architecture is a main stream deep learning model fo...
research
04/27/2021

Rethinking BiSeNet For Real-time Semantic Segmentation

BiSeNet has been proved to be a popular two-stream network for real-time...
research
11/19/2019

Differentiating Features for Scene Segmentation Based on Dedicated Attention Mechanisms

Semantic segmentation is a challenge in scene parsing. It requires both ...
research
07/17/2023

EGE-UNet: an Efficient Group Enhanced UNet for skin lesion segmentation

Transformer and its variants have been widely used for medical image seg...

Please sign up or login with your details

Forgot password? Click here to reset