ActiveStereoNet: End-to-End Self-Supervised Learning for Active Stereo Systems

07/16/2018
by   Yinda Zhang, et al.
4

In this paper we present ActiveStereoNet, the first deep learning solution for active stereo systems. Due to the lack of ground truth, our method is fully self-supervised, yet it produces precise depth with a subpixel precision of 1/30th of a pixel; it does not suffer from the common over-smoothing issues; it preserves the edges; and it explicitly handles occlusions. We introduce a novel reconstruction loss that is more robust to noise and texture-less patches, and is invariant to illumination changes. The proposed loss is optimized using a window-based cost aggregation with an adaptive support weight scheme. This cost aggregation is edge-preserving and smooths the loss function, which is key to allow the network to reach compelling results. Finally we show how the task of predicting invalid regions, such as occlusions, can be trained end-to-end without ground-truth. This component is crucial to reduce blur and particularly improves predictions along depth discontinuities. Extensive quantitatively and qualitatively evaluations on real and synthetic data demonstrate state of the art results in many challenging scenes.

READ FULL TEXT

page 6

page 11

page 13

page 14

page 21

page 22

page 25

page 26

research
12/06/2021

ActiveZero: Mixed Domain Learning for Active Stereovision with Zero Annotation

Traditional depth sensors generate accurate real world depth estimates t...
research
09/19/2019

Self-Supervised Monocular Depth Hints

Monocular depth estimators can be trained with various forms of self-sup...
research
07/09/2019

UnsuperPoint: End-to-end Unsupervised Interest Point Detector and Descriptor

It is hard to create consistent ground truth data for interest points in...
research
03/30/2023

NeRF-Supervised Deep Stereo

We introduce a novel framework for training deep stereo networks effortl...
research
08/28/2019

Self-supervised blur detection from synthetically blurred scenes

Blur detection aims at segmenting the blurred areas of a given image. Re...
research
09/03/2019

Self-Supervised Deep Depth Denoising

Depth perception is considered an invaluable source of information for v...
research
04/26/2023

MAPConNet: Self-supervised 3D Pose Transfer with Mesh and Point Contrastive Learning

3D pose transfer is a challenging generation task that aims to transfer ...

Please sign up or login with your details

Forgot password? Click here to reset