From Pixels to Buildings: End-to-end Probabilistic Deep Networks for Large-scale Semantic Mapping

12/31/2018
by   Kaiyu Zheng, et al.
0

We introduce TopoNets, end-to-end probabilistic deep networks for modeling semantic maps with structure reflecting the topology of large-scale environments. TopoNets build unified deep networks spanning multiple levels of abstraction and spatial scales, from pixels representing geometry of local places to high-level descriptions representing semantics of buildings. To this end, TopoNets leverage complex spatial relations expressed in terms of arbitrary, dynamic graphs. We demonstrate how TopoNets can be used to perform end-to-end semantic mapping from partial sensory observations and noisy topological relations discovered by a robot exploring large-scale office spaces. We further illustrate the benefits of the probabilistic representation by generating semantic descriptions augmented with valuable uncertainty information and utilizing likelihoods of complete semantic maps to detect novel and incongruent environment configurations.

READ FULL TEXT

page 1

page 4

research
06/11/2017

Learning Large-Scale Topological Maps Using Sum-Product Networks

In order to perform complex actions in human environments, an autonomous...
research
12/22/2021

Graph augmented Deep Reinforcement Learning in the GameRLand3D environment

We address planning and navigation in challenging 3D video games featuri...
research
05/17/2021

RoSmEEry: Robotic Simulated Environment for Evaluation and Benchmarking of Semantic Mapping Algorithms

Human-robot interaction requires a common understanding of the operation...
research
10/17/2020

Lifelong update of semantic maps in dynamic environments

A robot understands its world through the raw information it senses from...
research
01/10/2023

InstaGraM: Instance-level Graph Modeling for Vectorized HD Map Learning

The construction of lightweight High-definition (HD) maps containing geo...
research
07/08/2017

Embedding Visual Hierarchy with Deep Networks for Large-Scale Visual Recognition

In this paper, a level-wise mixture model (LMM) is developed by embeddin...
research
03/15/2019

SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representation

Systems which incrementally create 3D semantic maps from image sequences...

Please sign up or login with your details

Forgot password? Click here to reset