CI-Net: Contextual Information for Joint Semantic Segmentation and Depth Estimation

07/29/2021
by   Tianxiao Gao, et al.
8

Monocular depth estimation and semantic segmentation are two fundamental goals of scene understanding. Due to the advantages of task interaction, many works study the joint task learning algorithm. However, most existing methods fail to fully leverage the semantic labels, ignoring the provided context structures and only using them to supervise the prediction of segmentation split. In this paper, we propose a network injected with contextual information (CI-Net) to solve the problem. Specifically, we introduce self-attention block in the encoder to generate attention map. With supervision from the ground truth created by semantic labels, the network is embedded with contextual information so that it could understand the scene better, utilizing dependent features to make accurate prediction. Besides, a feature sharing module is constructed to make the task-specific features deeply fused and a consistency loss is devised to make the features mutually guided. We evaluate the proposed CI-Net on the NYU-Depth-v2 and SUN-RGBD datasets. The experimental results validate that our proposed CI-Net is competitive with the state-of-the-arts.

READ FULL TEXT

page 1

page 4

page 6

page 7

page 8

research
10/04/2022

FreDSNet: Joint Monocular Depth and Semantic Segmentation with Fast Fourier Convolutions

In this work we present FreDSNet, a deep learning solution which obtains...
research
07/14/2015

Lifting GIS Maps into Strong Geometric Context for Scene Understanding

Contextual information can have a substantial impact on the performance ...
research
09/15/2023

STDG: Semi-Teacher-Student Training Paradigram for Depth-guided One-stage Scene Graph Generation

Scene Graph Generation is a critical enabler of environmental comprehens...
research
01/19/2021

SOSD-Net: Joint Semantic Object Segmentation and Depth Estimation from Monocular images

Depth estimation and semantic segmentation play essential roles in scene...
research
04/17/2020

Learning to Predict Context-adaptive Convolution for Semantic Segmentation

Long-range contextual information is essential for achieving high-perfor...
research
10/04/2022

ASAP: Accurate semantic segmentation for real time performance

Feature fusion modules from encoder and self-attention module have been ...
research
06/02/2023

Towards In-context Scene Understanding

In-context learningx2013the ability to configure a model's behavior with...

Please sign up or login with your details

Forgot password? Click here to reset