Straight to Shapes: Real-time Detection of Encoded Shapes

11/23/2016
by   Saumya Jetley, et al.
0

Current object detection approaches predict bounding boxes, but these provide little instance-specific information beyond location, scale and aspect ratio. In this work, we propose to directly regress to objects' shapes in addition to their bounding boxes and categories. It is crucial to find an appropriate shape representation that is compact and decodable, and in which objects can be compared for higher-order concepts such as view similarity, pose variation and occlusion. To achieve this, we use a denoising convolutional auto-encoder to establish an embedding space, and place the decoder after a fast end-to-end network trained to regress directly to the encoded shape vectors. This yields what to the best of our knowledge is the first real-time shape prediction network, running at 35 FPS on a high-end desktop. With higher-order shape reasoning well-integrated into the network pipeline, the network shows the useful practical quality of generalising to unseen categories similar to the ones in the training set, something that most existing approaches fail to handle.

READ FULL TEXT

page 6

page 7

page 8

page 11

page 13

page 14

page 15

page 16

research
06/08/2015

You Only Look Once: Unified, Real-Time Object Detection

We present YOLO, a new approach to object detection. Prior work on objec...
research
11/21/2022

NeRF-RPN: A general framework for object detection in NeRFs

This paper presents the first significant object detection framework, Ne...
research
12/21/2020

From Points to Multi-Object 3D Reconstruction

We propose a method to detect and reconstruct multiple 3D objects from a...
research
03/18/2020

Rethinking Object Detection in Retail Stores

The convention standard for object detection uses a bounding box to repr...
research
08/31/2020

Analysis and Prediction of Deforming 3D Shapes using Oriented Bounding Boxes and LSTM Autoencoders

For sequences of complex 3D shapes in time we present a general approach...
research
04/02/2020

DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes

We propose DOPS, a fast single-stage 3D object detection method for LIDA...
research
06/15/2019

Deep Set Prediction Networks

We study the problem of predicting a set from a feature vector with a de...

Please sign up or login with your details

Forgot password? Click here to reset