A Lightweight Neural Network for Monocular View Generation with Occlusion Handling

07/24/2020
by   Simon Evain, et al.
0

In this article, we present a very lightweight neural network architecture, trained on stereo data pairs, which performs view synthesis from one single image. With the growing success of multi-view formats, this problem is indeed increasingly relevant. The network returns a prediction built from disparity estimation, which fills in wrongly predicted regions using a occlusion handling technique. To do so, during training, the network learns to estimate the left-right consistency structural constraint on the pair of stereo input images, to be able to replicate it at test time from one single image. The method is built upon the idea of blending two predictions: a prediction based on disparity estimation, and a prediction based on direct minimization in occluded regions. The network is also able to identify these occluded areas at training and at test time by checking the pixelwise left-right consistency of the produced disparity maps. At test time, the approach can thus generate a left-side and a right-side view from one input image, as well as a depth map and a pixelwise confidence measure in the prediction. The work outperforms visually and metric-wise state-of-the-art approaches on the challenging KITTI dataset, all while reducing by a very significant order of magnitude (5 or 10 times) the required number of parameters (6.5 M).

READ FULL TEXT

page 3

page 5

page 7

page 9

page 10

page 12

page 13

research
03/18/2019

Bilateral Cyclic Constraint and Adaptive Regularization for Unsupervised Monocular Depth Prediction

Supervised learning methods to infer (hypothesize) depth of a scene from...
research
05/01/2019

Learn Stereo, Infer Mono: Siamese Networks for Self-Supervised, Monocular, Depth Estimation

The field of self-supervised monocular depth estimation has seen huge ad...
research
08/24/2020

DiverseNet: When One Right Answer is not Enough

Many structured prediction tasks in machine vision have a collection of ...
research
04/03/2018

Left-Right Comparative Recurrent Model for Stereo Matching

Leveraging the disparity information from both left and right views is c...
research
09/14/2022

FCDSN-DC: An Accurate and Lightweight Convolutional Neural Network for Stereo Estimation with Depth Completion

We propose an accurate and lightweight convolutional neural network for ...
research
10/21/2020

Geometry-based Occlusion-Aware Unsupervised Stereo Matching for Autonomous Driving

Recently, there are emerging many stereo matching methods for autonomous...
research
02/02/2022

Multi-Resolution Factor Graph Based Stereo Correspondence Algorithm

A dense depth-map of a scene at an arbitrary view orientation can be est...

Please sign up or login with your details

Forgot password? Click here to reset