Self-Supervised Learning of Depth and Camera Motion from 360° Videos

11/13/2018
by   Fu-En Wang, et al.
2

As 360 cameras become prevalent in many autonomous systems (e.g., self-driving cars and drones), efficient 360 perception becomes more and more important. We propose a novel self-supervised learning approach for predicting the omnidirectional depth and camera motion from a 360 video. In particular, starting from the SfMLearner, which is designed for cameras with normal field-of-view, we introduce three key features to process 360 images efficiently. Firstly, we convert each image from equirectangular projection to cubic projection in order to avoid image distortion. In each network layer, we use Cube Padding (CP), which pads intermediate features from adjacent faces, to avoid image boundaries. Secondly, we propose a novel "spherical" photometric consistency constraint on the whole viewing sphere. In this way, no pixel will be projected outside the image boundary which typically happens in images with normal field-of-view. Finally, rather than naively estimating six independent camera motions (i.e., naively applying SfM-Learner to each face on a cube), we propose a novel camera pose consistency loss to ensure the estimated camera motions reaching consensus. To train and evaluate our approach, we collect a new PanoSUNCG dataset containing a large amount of 360 videos with groundtruth depth and camera motion. Our approach achieves state-of-the-art depth prediction and camera motion estimation on PanoSUNCG with faster inference speed comparing to equirectangular. In real-world indoor videos, our approach can also achieve qualitatively reasonable depth prediction by acquiring model pre-trained on PanoSUNCG.

READ FULL TEXT

page 9

page 12

page 13

page 14

research
04/07/2022

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation

Depth estimation from images serves as the fundamental step of 3D percep...
research
08/15/2020

Neural Ray Surfaces for Self-Supervised Learning of Depth and Ego-motion

Self-supervised learning has emerged as a powerful tool for depth and eg...
research
06/04/2018

Cube Padding for Weakly-Supervised Saliency Prediction in 360° Videos

Automatic saliency prediction in 360 videos is critical for viewpoint gu...
research
04/09/2021

SVDistNet: Self-Supervised Near-Field Distance Estimation on Surround View Fisheye Cameras

A 360 perception of scene geometry is essential for automated driving, n...
research
10/07/2019

FisheyeDistanceNet: Self-Supervised Scale-Aware Distance Estimation using Monocular Fisheye Camera for Autonomous Driving

Fisheye cameras are commonly used in applications like autonomous drivin...
research
04/25/2017

Automatic Content-aware Projection for 360° Videos

To watch 360 videos on normal 2D displays, we need to project the select...
research
02/28/2022

A Novel Viewport-Adaptive Motion Compensation Technique for Fisheye Video

Although fisheye cameras are in high demand in many application areas du...

Please sign up or login with your details

Forgot password? Click here to reset