Learning to Recover 3D Scene Shape from a Single Image

12/17/2020
by   Wei Yin, et al.
0

Despite significant progress in monocular depth estimation in the wild, recent state-of-the-art methods cannot be used to recover accurate 3D scene shape due to an unknown depth shift induced by shift-invariant reconstruction losses used in mixed-data depth prediction training, and possible unknown camera focal length. We investigate this problem in detail, and propose a two-stage framework that first predicts depth up to an unknown scale and shift from a single monocular image, and then use 3D point cloud encoders to predict the missing depth shift and focal length that allow us to recover a realistic 3D scene shape. In addition, we propose an image-level normalized regression loss and a normal-based geometry loss to enhance depth prediction models trained on mixed datasets. We test our depth model on nine unseen datasets and achieve state-of-the-art performance on zero-shot dataset generalization. Code is available at: https://git.io/Depth

READ FULL TEXT

page 6

page 7

page 11

page 12

page 13

page 14

page 15

page 16

research
07/20/2023

Metric3D: Towards Zero-shot Metric 3D Prediction from A Single Image

Reconstructing accurate 3D scenes from images is a long-standing vision ...
research
09/18/2023

Robust Geometry-Preserving Depth Estimation Using Differentiable Rendering

In this study, we address the challenge of 3D scene structure recovery f...
research
07/30/2018

Geo-Supervised Visual Depth Prediction

We propose using global orientation from inertial measurements, and the ...
research
09/11/2023

SIM-Sync: From Certifiably Optimal Synchronization over the 3D Similarity Group to Scene Reconstruction with Learned Depth

This paper presents SIM-Sync, a certifiably optimal algorithm that estim...
research
07/29/2019

Enforcing geometric constraints of virtual normal for depth prediction

Monocular depth prediction plays a crucial role in understanding 3D scen...
research
02/26/2021

Boundary-induced and scene-aggregated network for monocular depth prediction

Monocular depth prediction is an important task in scene understanding. ...
research
06/05/2023

Single-Stage 3D Geometry-Preserving Depth Estimation Model Training on Dataset Mixtures with Uncalibrated Stereo Data

Nowadays, robotics, AR, and 3D modeling applications attract considerabl...

Please sign up or login with your details

Forgot password? Click here to reset