Impact of Pseudo Depth on Open World Object Segmentation with Minimal User Guidance

by   Robin Schön, et al.

Pseudo depth maps are depth map predicitions which are used as ground truth during training. In this paper we leverage pseudo depth maps in order to segment objects of classes that have never been seen during training. This renders our object segmentation task an open world task. The pseudo depth maps are generated using pretrained networks, which have either been trained with the full intention to generalize to downstream tasks (LeRes and MiDaS), or which have been trained in an unsupervised fashion on video sequences (MonodepthV2). In order to tell our network which object to segment, we provide the network with a single click on the object's surface on the pseudo depth map of the image as input. We test our approach on two different scenarios: One without the RGB image and one where the RGB image is part of the input. Our results demonstrate a considerably better generalization performance from seen to unseen object types when depth is used. On the Semantic Boundaries Dataset we achieve an improvement from 61.57 to 69.79 IoU score on unseen classes, when only using half of the training classes during training and performing the segmentation on depth maps only.


page 1

page 5

page 7


Design Pseudo Ground Truth with Motion Cue for Unsupervised Video Object Segmentation

One major technique debt in video object segmentation is to label the ob...

Matte Anything: Interactive Natural Image Matting with Segment Anything Models

Natural image matting algorithms aim to predict the transparency map (al...

DenseLiDAR: A Real-Time Pseudo Dense Depth Guided Depth Completion Network

Depth Completion can produce a dense depth map from a sparse input and p...

Depth-SIMS: Semi-Parametric Image and Depth Synthesis

In this paper we present a compositing image synthesis method that gener...

Depth-wise layering of 3d images using dense depth maps: a threshold based approach

Image segmentation has long been a basic problem in computer vision. Dep...

Learning to Better Segment Objects from Unseen Classes with Unlabeled Videos

The ability to localize and segment objects from unseen classes would op...

DenseAttentionSeg: Segment Hands from Interacted Objects Using Depth Input

We propose a real-time DNN-based technique to segment hand and object of...

Please sign up or login with your details

Forgot password? Click here to reset