PP-MobileSeg: Explore the Fast and Accurate Semantic Segmentation Model on Mobile Devices

04/11/2023
by   Shiyu Tang, et al.
4

The success of transformers in computer vision has led to several attempts to adapt them for mobile devices, but their performance remains unsatisfactory in some real-world applications. To address this issue, we propose PP-MobileSeg, a semantic segmentation model that achieves state-of-the-art performance on mobile devices. PP-MobileSeg comprises three novel parts: the StrideFormer backbone, the Aggregated Attention Module (AAM), and the Valid Interpolate Module (VIM). The four-stage StrideFormer backbone is built with MV3 blocks and strided SEA attention, and it is able to extract rich semantic and detailed features with minimal parameter overhead. The AAM first filters the detailed features through semantic feature ensemble voting and then combines them with semantic features to enhance the semantic information. Furthermore, we proposed VIM to upsample the downsampled feature to the resolution of the input image. It significantly reduces model latency by only interpolating classes present in the final prediction, which is the most significant contributor to overall model latency. Extensive experiments show that PP-MobileSeg achieves a superior tradeoff between accuracy, model size, and latency compared to other methods. On the ADE20K dataset, PP-MobileSeg achieves 1.57 SeaFormer-Base with 32.9 Qualcomm Snapdragon 855. Source codes are available at https://github.com/PaddlePaddle/PaddleSeg/tree/release/2.8.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/12/2022

TopFormer: Token Pyramid Transformer for Mobile Semantic Segmentation

Although vision transformers (ViTs) have achieved great success in compu...
research
01/30/2023

SeaFormer: Squeeze-enhanced Axial Transformer for Mobile Semantic Segmentation

Since the introduction of Vision Transformers, the landscape of many com...
research
03/22/2021

CFPNet: Channel-wise Feature Pyramid for Real-Time Semantic Segmentation

Real-time semantic segmentation is playing a more important role in comp...
research
06/08/2022

An Improved One millisecond Mobile Backbone

Efficient neural network backbones for mobile devices are often optimize...
research
07/01/2023

MobileViG: Graph-Based Sparse Attention for Mobile Vision Applications

Traditionally, convolutional neural networks (CNN) and vision transforme...
research
08/24/2023

Boosting Semantic Segmentation from the Perspective of Explicit Class Embeddings

Semantic segmentation is a computer vision task that associates a label ...
research
11/20/2018

CGNet: A Light-weight Context Guided Network for Semantic Segmentation

The demand of applying semantic segmentation model on mobile devices has...

Please sign up or login with your details

Forgot password? Click here to reset