Multi-layer Feature Aggregation for Deep Scene Parsing Models

11/04/2020
by   Litao Yu, et al.
0

Scene parsing from images is a fundamental yet challenging problem in visual content understanding. In this dense prediction task, the parsing model assigns every pixel to a categorical label, which requires the contextual information of adjacent image patches. So the challenge for this learning task is to simultaneously describe the geometric and semantic properties of objects or a scene. In this paper, we explore the effective use of multi-layer feature outputs of the deep parsing networks for spatial-semantic consistency by designing a novel feature aggregation module to generate the appropriate global representation prior, to improve the discriminative power of features. The proposed module can auto-select the intermediate visual features to correlate the spatial and semantic information. At the same time, the multiple skip connections form a strong supervision, making the deep parsing network easy to train. Extensive experiments on four public scene parsing datasets prove that the deep parsing network equipped with the proposed feature aggregation module can achieve very promising results.

READ FULL TEXT
research
07/10/2022

Self-attention on Multi-Shifted Windows for Scene Segmentation

Scene segmentation in images is a fundamental yet challenging problem in...
research
06/19/2018

MoE-SPNet: A Mixture-of-Experts Scene Parsing Network

Scene parsing is an indispensable component in understanding the semanti...
research
04/13/2022

Deep Learning Model with GA based Feature Selection and Context Integration

Deep learning models have been very successful in computer vision and im...
research
06/23/2020

Non-parametric spatially constrained local prior for scene parsing on real-world data

Scene parsing aims to recognize the object category of every pixel in sc...
research
04/13/2022

Context-based Deep Learning Architecture with Optimal Integration Layer for Image Parsing

Deep learning models have been efficient lately on image parsing tasks. ...
research
03/29/2023

DPF: Learning Dense Prediction Fields with Weak Supervision

Nowadays, many visual scene understanding problems are addressed by dens...
research
02/24/2018

Spatially Constrained Location Prior for Scene Parsing

Semantic context is an important and useful cue for scene parsing in com...

Please sign up or login with your details

Forgot password? Click here to reset