Towards Building the Semantic Map from a Monocular Camera with a Multi-task Network

01/17/2019
by   Lei Fan, et al.
12

In many robotic applications, especially for the autonomous driving, understanding the semantic information and the geometric structure of surroundings are both essential. Semantic 3D maps, as a carrier of the environmental knowledge, are then intensively studied for their abilities and applications. However, it is still challenging to produce a dense outdoor semantic map from a monocular image stream. Motivated by this target, in this paper, we propose a method for large-scale 3D reconstruction from consecutive monocular images. First, with the correlation of underlying information between depth and semantic prediction, a novel multi-task Convolutional Neural Network (CNN) is designed for joint prediction. Given a single image, the network learns low-level information with a shared encoder and separately predicts with decoders containing additional Atrous Spatial Pyramid Pooling (ASPP) layers and the residual connection which merits disparities and semantic mutually. To overcome the inconsistency of monocular depth prediction for reconstruction, post-processing steps with the superpixelization and the effective 3D representation approach are obtained to give the final semantic map. Experiments are compared with other methods on both semantic labeling and depth prediction. We also qualitatively demonstrate the map reconstructed from large-scale, difficult monocular image sequences to prove the effectiveness and superiority.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 6

research
08/13/2018

3D Geometry-Aware Semantic Labeling of Outdoor Street Scenes

This paper is concerned with the problem of how to better exploit 3D geo...
research
06/07/2020

DeepRelativeFusion: Dense Monocular SLAM using Single-Image Relative Depth Prediction

Traditional monocular visual simultaneous localization and mapping (SLAM...
research
11/04/2019

Technical Report: Co-learning of geometry and semantics for online 3D mapping

This paper is a technical report about our submission for the ECCV 2018 ...
research
04/21/2023

FSNet: Redesign Self-Supervised MonoDepth for Full-Scale Depth Prediction for Autonomous Driving

Predicting accurate depth with monocular images is important for low-cos...
research
02/24/2022

N-QGN: Navigation Map from a Monocular Camera using Quadtree Generating Networks

Monocular depth estimation has been a popular area of research for sever...
research
10/24/2022

Depth Monocular Estimation with Attention-based Encoder-Decoder Network from Single Image

Depth information is the foundation of perception, essential for autonom...

Please sign up or login with your details

Forgot password? Click here to reset