Learning Rich Features from RGB-D Images for Object Detection and Segmentation

07/22/2014
by   Saurabh Gupta, et al.
0

In this paper we study the problem of object detection for RGB-D images using semantically rich image and depth features. We propose a new geocentric embedding for depth images that encodes height above ground and angle with gravity for each pixel in addition to the horizontal disparity. We demonstrate that this geocentric embedding works better than using raw depth images for learning feature representations with convolutional neural networks. Our final object detection system achieves an average precision of 37.3 relative improvement over existing methods. We then focus on the task of instance segmentation where we label pixels belonging to object instances found by our detector. For this task, we propose a decision forest approach that classifies pixels in the detection window as foreground or background using a family of unary and binary tests that query shape and geocentric pose features. Finally, we use the output from our object detectors in an existing superpixel classification framework for semantic scene segmentation and achieve a 24 relative improvement over current state-of-the-art for the object categories that we study. We believe advances such as those represented in this paper will facilitate the use of perception in fields like robotics.

READ FULL TEXT

page 2

page 11

research
12/22/2014

Convolutional Neural Networks for joint object detection and pose estimation: A comparative study

In this paper we study the application of convolutional neural networks ...
research
11/28/2016

Object Detection Free Instance Segmentation With Labeling Transformations

Instance segmentation has attracted recent attention in computer vision ...
research
07/25/2019

Object as Distribution

Object detection is a critical part of visual scene understanding. The r...
research
07/30/2020

Learning RGB-D Feature Embeddings for Unseen Object Instance Segmentation

Segmenting unseen objects in cluttered scenes is an important skill that...
research
08/07/2020

Cascade Graph Neural Networks for RGB-D Salient Object Detection

In this paper, we study the problem of salient object detection (SOD) fo...
research
06/07/2020

CubifAE-3D: Monocular Camera Space Cubification on Autonomous Vehicles for Auto-Encoder based 3D Object Detection

We introduce a method for 3D object detection using a single monocular i...
research
08/02/2017

Semantic Instance Labeling Leveraging Hierarchical Segmentation

Most of the approaches for indoor RGBD semantic la- beling focus on usin...

Please sign up or login with your details

Forgot password? Click here to reset