MonoPLFlowNet: Permutohedral Lattice FlowNet for Real-Scale 3D Scene FlowEstimation with Monocular Images

11/24/2021
by   Runfa Li, et al.
10

Real-scale scene flow estimation has become increasingly important for 3D computer vision. Some works successfully estimate real-scale 3D scene flow with LiDAR. However, these ubiquitous and expensive sensors are still unlikely to be equipped widely for real application. Other works use monocular images to estimate scene flow, but their scene flow estimations are normalized with scale ambiguity, where additional depth or point cloud ground truth are required to recover the real scale. Even though they perform well in 2D, these works do not provide accurate and reliable 3D estimates. We present a deep learning architecture on permutohedral lattice - MonoPLFlowNet. Different from all previous works, our MonoPLFlowNet is the first work where only two consecutive monocular images are used as input, while both depth and 3D scene flow are estimated in real scale. Our real-scale scene flow estimation outperforms all state-of-the-art monocular-image based works recovered to real scale by ground truth, and is comparable to LiDAR approaches. As a by-product, our real-scale depth estimation also outperforms other state-of-the-art works.

READ FULL TEXT

page 1

page 3

page 8

research
08/24/2021

Bridging Unsupervised and Supervised Depth from Focus via All-in-Focus Supervision

Depth estimation is a long-lasting yet important task in computer vision...
research
06/08/2022

Unsupervised Learning of 3D Scene Flow from Monocular Camera

Scene flow represents the motion of points in the 3D space, which is the...
research
07/10/2018

SceneEDNet: A Deep Learning Approach for Scene Flow Estimation

Estimating scene flow in RGB-D videos is attracting much interest of the...
research
03/31/2020

Distilled Semantics for Comprehensive Scene Understanding from Videos

Whole understanding of the surroundings is paramount to autonomous syste...
research
10/09/2018

Understanding and Predicting the Memorability of Natural Scene Images

Memorability measures how easily an image is to be memorized after glanc...
research
08/18/2020

DeepLiDARFlow: A Deep Learning Architecture For Scene Flow Estimation Using Monocular Camera and Sparse LiDAR

Scene flow is the dense 3D reconstruction of motion and geometry of a sc...
research
10/07/2021

Estimating Image Depth in the Comics Domain

Estimating the depth of comics images is challenging as such images a) a...

Please sign up or login with your details

Forgot password? Click here to reset