Boosting Real-Time Driving Scene Parsing with Shared Semantics

09/16/2019
by   Zhenzhen Xiang, et al.
5

Real-time scene parsing is a fundamental feature for autonomous driving vehicles with multiple cameras. Comparing with traditional methods which individually process the frames from each camera, in this letter we demonstrate that sharing semantics between cameras with overlapped views can boost the parsing performance. Our framework is based on a deep neural network for semantic segmentation but with two kinds of additional modules for sharing and fusing semantics. On one hand, a semantics sharing module is designed to establish the pixel-wise mapping between the input image pair. Features as well as semantics are shared by the map to reduce duplicated workload which leads to more efficient computation. On the other hand, feature fusion modules are designed to combine different modal of semantic features, which learns to leverage the information from both inputs for better results. To evaluate the effectiveness of the proposed framework, we collect a new dataset with a dual-camera vision system for driving scene parsing. Experimental results show that our network outperforms the baseline method on the parsing accuracy with comparable computations.

READ FULL TEXT

page 1

page 3

page 4

page 5

page 7

research
05/19/2020

Multi-modal Sensor Fusion-Based Deep Neural Network for End-to-end Autonomous Driving with Scene Understanding

This study aims to improve the control performance and generalization ca...
research
09/03/2021

Semantic Segmentation on VSPW Dataset through Aggregation of Transformer Models

Semantic segmentation is an important task in computer vision, from whic...
research
03/16/2021

Lite-HDSeg: LiDAR Semantic Segmentation Using Lite Harmonic Dense Convolutions

Autonomous driving vehicles and robotic systems rely on accurate percept...
research
08/11/2021

A Real-Time Online Learning Framework for Joint 3D Reconstruction and Semantic Segmentation of Indoor Scenes

This paper presents a real-time online vision framework to jointly recov...
research
05/15/2021

Aerial-PASS: Panoramic Annular Scene Segmentation in Drone Videos

Aerial pixel-wise scene perception of the surrounding environment is an ...
research
08/30/2022

Boosting Night-time Scene Parsing with Learnable Frequency

Night-Time Scene Parsing (NTSP) is essential to many vision applications...
research
11/28/2018

Future Segmentation Using 3D Structure

Predicting the future to anticipate the outcome of events and actions is...

Please sign up or login with your details

Forgot password? Click here to reset