Background Prompting for Improved Object Depth

06/08/2023
by   Manel Baradad, et al.
0

Estimating the depth of objects from a single image is a valuable task for many vision, robotics, and graphics applications. However, current methods often fail to produce accurate depth for objects in diverse scenes. In this work, we propose a simple yet effective Background Prompting strategy that adapts the input object image with a learned background. We learn the background prompts only using small-scale synthetic object datasets. To infer object depth on a real image, we place the segmented object into the learned background prompt and run off-the-shelf depth networks. Background Prompting helps the depth networks focus on the foreground object, as they are made invariant to background variations. Moreover, Background Prompting minimizes the domain gap between synthetic and real object images, leading to better sim2real generalization than simple finetuning. Results on multiple synthetic and real datasets demonstrate consistent improvements in real object depths for a variety of existing depth networks. Code and optimized background prompts can be found at: https://mbaradad.github.io/depth_prompt.

READ FULL TEXT

page 1

page 2

page 4

page 7

page 8

page 13

page 14

page 15

research
09/17/2019

Task-Aware Monocular Depth Estimation for 3D Object Detection

Monocular depth estimation enables 3D perception from a single 2D image,...
research
05/19/2020

Focus on defocus: bridging the synthetic to real domain gap for depth estimation

Data-driven depth estimation methods struggle with the generalization ou...
research
03/15/2023

RICO: Regularizing the Unobservable for Indoor Compositional Reconstruction

Recently, neural implicit surfaces have become popular for multi-view re...
research
10/04/2015

Background Image Generation Using Boolean Operations

Tracking moving objects from a video sequence requires segmentation of t...
research
07/27/2021

DCL: Differential Contrastive Learning for Geometry-Aware Depth Synthesis

We describe a method for realistic depth synthesis that learns diverse v...
research
06/14/2015

Resolving Scale Ambiguity Via XSlit Aspect Ratio Analysis

In perspective cameras, images of a frontal-parallel 3D object preserve ...
research
01/17/2019

Background subtraction on depth videos with convolutional neural networks

Background subtraction is a significant component of computer vision sys...

Please sign up or login with your details

Forgot password? Click here to reset