Short-Term Prediction and Multi-Camera Fusion on Semantic Grids

03/21/2019
by   Lukas Hoyer, et al.
4

An environment representation (ER) is a substantial part of every autonomous system. It introduces a common interface between perception and other system components, such as decision making, and allows downstream algorithms to deal with abstracted data without knowledge of the used sensor. In this work, we propose and evaluate a novel architecture that generates an egocentric, grid-based, predictive, and semantically-interpretable ER. In particular, we provide a proof of concept for the spatio-temporal fusion of multiple camera sequences and short-term prediction in such an ER. Our design utilizes a strong semantic segmentation network together with depth and egomotion estimates to first extract semantic information from multiple camera streams and then transform these separately into egocentric temporally-aligned bird's-eye view grids. A deep encoder-decoder network is trained to fuse a stack of these grids into a unified semantic grid representation and to predict the dynamics of its surrounding. We evaluate this representation on real-world sequences of the Cityscapes dataset and show that our architecture can make accurate predictions in complex sensor fusion scenarios and significantly outperforms a model-driven baseline in a category-based evaluation.

READ FULL TEXT

page 1

page 2

page 4

page 5

page 11

page 12

page 13

page 14

research
06/17/2020

FISHING Net: Future Inference of Semantic Heatmaps In Grids

For autonomous robots to navigate a complex environment, it is crucial t...
research
05/26/2022

BEVFusion: Multi-Task Multi-Sensor Fusion with Unified Bird's-Eye View Representation

Multi-sensor fusion is essential for an accurate and reliable autonomous...
research
06/27/2022

LaRa: Latents and Rays for Multi-Camera Bird's-Eye-View Semantic Segmentation

Recent works in autonomous driving have widely adopted the bird's-eye-vi...
research
11/14/2022

LAPTNet: LiDAR-Aided Perspective Transform Network

Semantic grids are a useful representation of the environment around a r...
research
02/09/2022

A Multi-Task Recurrent Neural Network for End-to-End Dynamic Occupancy Grid Mapping

A common approach for modeling the environment of an autonomous vehicle ...
research
06/17/2020

Evaluation of 3D CNN Semantic Mapping for Rover Navigation

Terrain assessment is a key aspect for autonomous exploration rovers, su...
research
07/04/2012

Map-aided Fusion Using Evidential Grids for Mobile Perception in Urban Environment

Evidential grids have been recently used for mobile object perception. T...

Please sign up or login with your details

Forgot password? Click here to reset