Self-supervised Learning for Single View Depth and Surface Normal Estimation

03/01/2019
by   Huangying Zhan, et al.
0

In this work we present a self-supervised learning framework to simultaneously train two Convolutional Neural Networks (CNNs) to predict depth and surface normals from a single image. In contrast to most existing frameworks which represent outdoor scenes as fronto-parallel planes at piece-wise smooth depth, we propose to predict depth with surface orientation while assuming that natural scenes have piece-wise smooth normals. We show that a simple depth-normal consistency as a soft-constraint on the predictions is sufficient and effective for training both these networks simultaneously. The trained normal network provides state-of-the-art predictions while the depth network, relying on much realistic smooth normal assumption, outperforms the traditional self-supervised depth prediction network by a large margin on the KITTI benchmark. Demo video: https://youtu.be/ZD-ZRsw7hdM

READ FULL TEXT

page 1

page 4

research
03/13/2023

A Surface-normal Based Neural Framework for Colonoscopy Reconstruction

Reconstructing a 3D surface from colonoscopy video is challenging due to...
research
04/07/2021

Self-supervised Learning of Depth Inference for Multi-view Stereo

Recent supervised multi-view depth estimation networks have achieved pro...
research
03/06/2023

MACARONS: Mapping And Coverage Anticipation with RGB Online Self-Supervision

We introduce a method that simultaneously learns to explore new large en...
research
10/16/2019

Animating Landscape: Self-Supervised Learning of Decoupled Motion and Appearance for Single-Image Video Synthesis

Automatic generation of a high-quality video from a single image remains...
research
06/07/2021

Self-Supervised Structure-from-Motion through Tightly-Coupled Depth and Egomotion Networks

Much recent literature has formulated structure-from-motion (SfM) as a s...
research
04/03/2022

Distortion-Aware Self-Supervised 360° Depth Estimation from A Single Equirectangular Projection Image

360 images are widely available over the last few years. This paper prop...
research
03/15/2018

LEGO: Learning Edge with Geometry all at Once by Watching Videos

Learning to estimate 3D geometry in a single image by watching unlabeled...

Please sign up or login with your details

Forgot password? Click here to reset