Learning Object Depth from Camera Motion and Video Object Segmentation

07/11/2020
by   Brent A. Griffin, et al.
8

Video object segmentation, i.e., the separation of a target object from background in video, has made significant progress on real and challenging videos in recent years. To leverage this progress in 3D applications, this paper addresses the problem of learning to estimate the depth of segmented objects given some measurement of camera motion (e.g., from robot kinematics or vehicle odometry). We achieve this by, first, introducing a diverse, extensible dataset and, second, designing a novel deep network that estimates the depth of objects using only segmentation masks and uncalibrated camera movement. Our data-generation framework creates artificial object segmentations that are scaled for changes in distance between the camera and object, and our network learns to estimate object depth even with segmentation errors. We demonstrate our approach across domains using a robot camera to locate objects from the YCB dataset and a vehicle camera to locate obstacles while driving.

READ FULL TEXT

page 2

page 9

page 10

page 11

page 16

page 18

research
03/02/2021

Depth from Camera Motion and Object Detection

This paper addresses the problem of learning to estimate the depth of de...
research
03/20/2019

Video Object Segmentation-based Visual Servo Control and Object Depth Estimation on a Mobile Robot Platform

To be useful in everyday environments, robots must be able to identify a...
research
03/28/2019

BubbleNets: Learning to Select the Guidance Frame in Video Object Segmentation by Deep Sorting Frames

Semi-supervised video object segmentation has made significant progress ...
research
03/18/2019

EV-IMO: Motion Segmentation Dataset and Learning Pipeline for Event Cameras

We present the first event-based learning approach for motion segmentati...
research
07/31/2022

One Object at a Time: Accurate and Robust Structure From Motion for Robots

A gaze-fixating robot perceives distance to the fixated object and relat...
research
05/05/2021

Moving SLAM: Fully Unsupervised Deep Learning in Non-Rigid Scenes

We propose a method to train deep networks to decompose videos into 3D g...
research
06/26/2020

An Advert Creation System for 3D Product Placements

Over the past decade, the evolution of video-sharing platforms has attra...

Please sign up or login with your details

Forgot password? Click here to reset