Single Multi-feature detector for Amodal 3D Object Detection in RGB-D Images

11/01/2017
by   Qianhui Luo, et al.
0

This paper aims at fast and high-accuracy amodal 3D object detections in RGB-D images, which requires a compact 3D bounding box around the whole object even under partial observations. To avoid the time-consuming proposals preextraction, we propose a single end-to-end framework based on the deep neural networks which hierarchically incorporates appearance and geometric features from 2.5D representation to 3D objects. The depth information has helped on reducing the output space of 3D bounding boxes into a manageable set of 3D anchor boxes with different sizes on multiple feature layers. At prediction time, in a convolutional fashion, the network predicts scores for categories and adjustments for locations, sizes and orientations of each 3D anchor box, which has considered multi-scale 2D features. Experiments on the challenging SUN RGB-D datasets show that our algorithm outperforms the state-of-the-art by 10.2 in mAP and is 88x faster than the Deep Sliding Shape. In addition, experiments suggest our algorithm even with a smaller input image size performs comparably but is 454x faster than the state-of-art on NYUV2 datasets.

READ FULL TEXT

page 2

page 8

research
04/15/2019

Universal Bounding Box Regression and Its Applications

Bounding-box regression is a popular technique to refine or predict loca...
research
12/08/2015

SSD: Single Shot MultiBox Detector

We present a method for detecting objects in images using a single deep ...
research
11/30/2016

Deep Cuboid Detection: Beyond 2D Bounding Boxes

We present a Deep Cuboid Detector which takes a consumer-quality RGB ima...
research
06/26/2020

Expandable YOLO: 3D Object Detection from RGB-D Images

This paper aims at constructing a light-weight object detector that inpu...
research
11/12/2015

ProNet: Learning to Propose Object-specific Boxes for Cascaded Neural Networks

This paper aims to classify and locate objects accurately and efficientl...
research
12/16/2019

PerspectiveNet: 3D Object Detection from a Single RGB Image via Perspective Points

Detecting 3D objects from a single RGB image is intrinsically ambiguous,...
research
04/12/2016

Fast Object Localization Using a CNN Feature Map Based Multi-Scale Search

Object localization is an important task in computer vision but requires...

Please sign up or login with your details

Forgot password? Click here to reset