Simple and Efficient Architectures for Semantic Segmentation

06/16/2022
by   Dushyant Mehta, et al.
19

Though the state-of-the architectures for semantic segmentation, such as HRNet, demonstrate impressive accuracy, the complexity arising from their salient design choices hinders a range of model acceleration tools, and further they make use of operations that are inefficient on current hardware. This paper demonstrates that a simple encoder-decoder architecture with a ResNet-like backbone and a small multi-scale head, performs on-par or better than complex semantic segmentation architectures such as HRNet, FANet and DDRNets. Naively applying deep backbones designed for Image Classification to the task of Semantic Segmentation leads to sub-par results, owing to a much smaller effective receptive field of these backbones. Implicit among the various design choices put forth in works like HRNet, DDRNet, and FANet are networks with a large effective receptive field. It is natural to ask if a simple encoder-decoder architecture would compare favorably if comprised of backbones that have a larger effective receptive field, though without the use of inefficient operations like dilated convolutions. We show that with minor and inexpensive modifications to ResNets, enlarging the receptive field, very simple and competitive baselines can be created for Semantic Segmentation. We present a family of such simple architectures for desktop as well as mobile targets, which match or exceed the performance of complex models on the Cityscapes dataset. We hope that our work provides simple yet effective baselines for practitioners to develop efficient semantic segmentation models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/31/2020

Rethinking Semantic Segmentation from a Sequence-to-Sequence Perspective with Transformers

Most recent semantic segmentation methods adopt a fully-convolutional ne...
research
07/28/2019

Dilated Point Convolutions: On the Receptive Field of Point Convolutions

In this work, we propose Dilated Point Convolutions (DPC) which drastica...
research
04/30/2018

On the iterative refinement of densely connected representation levels for semantic segmentation

State-of-the-art semantic segmentation approaches increase the receptive...
research
11/10/2020

MP-ResNet: Multi-path Residual Network for the Semantic segmentation of High-Resolution PolSAR Images

There are limited studies on the semantic segmentation of high-resolutio...
research
07/21/2022

FADE: Fusing the Assets of Decoder and Encoder for Task-Agnostic Upsampling

We consider the problem of task-agnostic feature upsampling in dense pre...
research
12/22/2021

MOSAIC: Mobile Segmentation via decoding Aggregated Information and encoded Context

We present a next-generation neural network architecture, MOSAIC, for ef...
research
08/30/2022

Probing Contextual Diversity for Dense Out-of-Distribution Detection

Detection of out-of-distribution (OoD) samples in the context of image c...

Please sign up or login with your details

Forgot password? Click here to reset