Efficient Context Integration through Factorized Pyramidal Learning for Ultra-Lightweight Semantic Segmentation

02/23/2023
by   Nadeem Atif, et al.
0

Semantic segmentation is a pixel-level prediction task to classify each pixel of the input image. Deep learning models, such as convolutional neural networks (CNNs), have been extremely successful in achieving excellent performances in this domain. However, mobile application, such as autonomous driving, demand real-time processing of incoming stream of images. Hence, achieving efficient architectures along with enhanced accuracy is of paramount importance. Since, accuracy and model size of CNNs are intrinsically contentious in nature, the challenge is to achieve a decent trade-off between accuracy and model size. To address this, we propose a novel Factorized Pyramidal Learning (FPL) module to aggregate rich contextual information in an efficient manner. On one hand, it uses a bank of convolutional filters with multiple dilation rates which leads to multi-scale context aggregation; crucial in achieving better accuracy. On the other hand, parameters are reduced by a careful factorization of the employed filters; crucial in achieving lightweight models. Moreover, we decompose the spatial pyramid into two stages which enables a simple and efficient feature fusion within the module to solve the notorious checkerboard effect. We also design a dedicated Feature-Image Reinforcement (FIR) unit to carry out the fusion operation of shallow and deep features with the downsampled versions of the input image. This gives an accuracy enhancement without increasing model parameters. Based on the FPL module and FIR unit, we propose an ultra-lightweight real-time network, called FPLNet, which achieves state-of-the-art accuracy-efficiency trade-off. More specifically, with only less than 0.5 million parameters, the proposed network achieves 66.93% and 66.28% mIoU on Cityscapes validation and test set, respectively. Moreover, FPLNet has a processing speed of 95.5 frames per second (FPS).

READ FULL TEXT

page 1

page 2

page 6

page 10

research
09/18/2019

Feature Pyramid Encoding Network for Real-time Semantic Segmentation

Although current deep learning methods have achieved impressive results ...
research
03/09/2020

FarSee-Net: Real-Time Semantic Segmentation by Efficient Multi-scale Context Aggregation and Feature Space Super-resolution

Real-time semantic segmentation is desirable in many robotic application...
research
11/02/2019

FDDWNet: A Lightweight Convolutional Neural Network for Real-time Sementic Segmentation

This paper introduces a lightweight convolutional neural network, called...
research
03/22/2021

CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Real-time semantic segmentation is playing a more important role in comp...
research
09/02/2021

FBSNet: A Fast Bilateral Symmetrical Network for Real-Time Semantic Segmentation

Real-time semantic segmentation, which can be visually understood as the...
research
07/26/2019

DABNet: Depth-wise Asymmetric Bottleneck for Real-time Semantic Segmentation

As a pixel-level prediction task, semantic segmentation needs large comp...
research
06/04/2022

PIDNet: A Real-time Semantic Segmentation Network Inspired from PID Controller

Two-branch network architecture has shown its efficiency and effectivene...

Please sign up or login with your details

Forgot password? Click here to reset