CodeSLAM - Learning a Compact, Optimisable Representation for Dense Visual SLAM

04/03/2018
by   Michael Bloesch, et al.
1

The representation of geometry in real-time 3D perception systems continues to be a critical research issue. Dense maps capture complete surface shape and can be augmented with semantic labels, but their high dimensionality makes them computationally costly to store and process, and unsuitable for rigorous probabilistic inference. Sparse feature-based representations avoid these problems, but capture only partial scene information and are mainly useful for localisation only. We present a new compact but dense representation of scene geometry which is conditioned on the intensity data from a single image and generated from a code consisting of a small number of parameters. We are inspired by work both on learned depth from images, and auto-encoders. Our approach is suitable for use in a keyframe-based monocular dense SLAM system: While each keyframe with a code can produce a depth map, the code can be optimised efficiently jointly with pose variables and together with the codes of overlapping keyframes to attain global consistency. Conditioning the depth map on the image allows the code to only represent aspects of the local geometry which cannot directly be predicted from the image. We explain how to learn our code representation, and demonstrate its advantageous properties in monocular SLAM.

READ FULL TEXT

page 1

page 3

page 6

page 7

page 8

research
01/14/2020

DeepFactors: Real-Time Probabilistic Dense Monocular SLAM

The ability to estimate rich geometry and camera motion from monocular i...
research
07/19/2021

CodeMapping: Real-Time Dense Mapping for Sparse SLAM using Compact Scene Representations

We propose a novel dense mapping framework for sparse visual SLAM system...
research
03/15/2019

SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representation

Systems which incrementally create 3D semantic maps from image sequences...
research
03/15/2019

SceneCode: Monocular Dense Semantic Reconstruction using Learned Encoded Scene Representations

Systems which incrementally create 3D semantic maps from image sequences...
research
06/12/2023

H-SLAM: Hybrid Direct-Indirect Visual SLAM

The recent success of hybrid methods in monocular odometry has led to ma...
research
07/23/2019

Deep-SLAM++: Object-level RGBD SLAM based on class-specific deep shape priors

In an effort to increase the capabilities of SLAM systems and produce ob...
research
04/13/2019

Direct Sparse Mapping

Photometric bundle adjustment, PBA, accurately estimates geometry from v...

Please sign up or login with your details

Forgot password? Click here to reset