Fusion of stereo and still monocular depth estimates in a self-supervised learning context

03/20/2018
by   Diogo Martins, et al.
0

We study how autonomous robots can learn by themselves to improve their depth estimation capability. In particular, we investigate a self-supervised learning setup in which stereo vision depth estimates serve as targets for a convolutional neural network (CNN) that transforms a single still image to a dense depth map. After training, the stereo and mono estimates are fused with a novel fusion method that preserves high confidence stereo estimates, while leveraging the CNN estimates in the low-confidence regions. The main contribution of the article is that it is shown that the fused estimates lead to a higher performance than the stereo vision estimates alone. Experiments are performed on the KITTI dataset, and on board of a Parrot SLAMDunk, showing that even rather limited CNNs can help provide stereo vision equipped robots with more reliable depth maps for autonomous navigation.

READ FULL TEXT

page 1

page 6

page 7

research
08/17/2020

Reversing the cycle: self-supervised deep stereo through enhanced monocular distillation

In many fields, self-supervised learning solutions are rapidly evolving ...
research
08/14/2020

Self-adapting confidence estimation for stereo

Estimating the confidence of disparity maps inferred by a stereo algorit...
research
10/07/2021

Self-Supervised Depth Completion for Active Stereo

Active stereo systems are widely used in the robotics industry due to th...
research
09/23/2017

Self-supervised learning: When is fusion of the primary and secondary sensor cue useful?

Self-supervised learning (SSL) is a reliable learning mechanism in which...
research
11/28/2017

Entropy-difference based stereo error detection

Stereo depth estimation is error-prone; hence, effective error detection...
research
03/09/2022

ChiTransformer:Towards Reliable Stereo from Cues

Current stereo matching techniques are challenged by restricted searchin...
research
02/24/2022

N-QGN: Navigation Map from a Monocular Camera using Quadtree Generating Networks

Monocular depth estimation has been a popular area of research for sever...

Please sign up or login with your details

Forgot password? Click here to reset