Improving Deep Stereo Network Generalization with Geometric Priors

08/25/2020
by   Jialiang Wang, et al.
8

End-to-end deep learning methods have advanced stereo vision in recent years and obtained excellent results when the training and test data are similar. However, large datasets of diverse real-world scenes with dense ground truth are difficult to obtain and currently not publicly available to the research community. As a result, many algorithms rely on small real-world datasets of similar scenes or synthetic datasets, but end-to-end algorithms trained on such datasets often generalize poorly to different images that arise in real-world applications. As a step towards addressing this problem, we propose to incorporate prior knowledge of scene geometry into an end-to-end stereo network to help networks generalize better. For a given network, we explicitly add a gradient-domain smoothness prior and occlusion reasoning into the network training, while the architecture remains unchanged during inference. Experimentally, we show consistent improvements if we train on synthetic datasets and test on the Middlebury (real images) dataset. Noticeably, we improve PSM-Net accuracy on Middlebury from 5.37 MAE to 3.21 MAE without sacrificing speed.

READ FULL TEXT

page 2

page 4

page 6

page 7

research
08/17/2019

OmniMVS: End-to-End Learning for Omnidirectional Stereo Matching

In this paper, we propose a novel end-to-end deep neural network model f...
research
12/06/2016

Deep Stereo Matching with Dense CRF Priors

Stereo reconstruction from rectified images has recently been revisited ...
research
04/12/2019

PWOC-3D: Deep Occlusion-Aware End-to-End Scene Flow Estimation

In the last few years, convolutional neural networks (CNNs) have demonst...
research
06/30/2022

Self-SuperFlow: Self-supervised Scene Flow Prediction in Stereo Sequences

In recent years, deep neural networks showed their exceeding capabilitie...
research
12/13/2018

Scene Recomposition by Learning-based ICP

By moving a depth sensor around a room, we compute a 3D CAD model of the...
research
12/14/2016

UnrealStereo: A Synthetic Dataset for Analyzing Stereo Vision

Stereo algorithm is important for robotics applications, such as quadcop...
research
07/10/2017

Automatic Construction of Real-World Datasets for 3D Object Localization using Two Cameras

Unlike classification, position labels cannot be assigned manually by hu...

Please sign up or login with your details

Forgot password? Click here to reset