Joint 3D Proposal Generation and Object Detection from View Aggregation

12/06/2017
by   Jason Ku, et al.
0

We present AVOD, an Aggregate View Object Detection network for autonomous driving scenarios. The proposed neural network architecture uses LIDAR point clouds and RGB images to generate features that are shared by two subnetworks: a region proposal network (RPN) and a second stage detector network. The proposed RPN uses a novel architecture capable of performing multimodal feature fusion to generate reliable 3D object proposals for multiple object classes in road scenes. Using these proposals, the second stage detection network performs accurate oriented 3D bounding box regression and category classification to predict the extents, orientation, and classification of objects in 3D space. Our proposed architecture is shown to produces state of the art results on the KITTI 3D object detection benchmark while running in real time with a low memory footprint, making it a suitable candidate for deployment on autonomous vehicles.

READ FULL TEXT

page 3

page 7

page 9

research
03/26/2019

FVNet: 3D Front-View Proposal Generation for Real-Time Object Detection from Point Clouds

3D object detection from raw and sparse point clouds has been far less t...
research
09/17/2020

Radar-Camera Sensor Fusion for Joint Object Detection and Distance Estimation in Autonomous Vehicles

In this paper we present a novel radar-camera sensor fusion framework fo...
research
02/17/2019

PIXOR: Real-time 3D Object Detection from Point Clouds

We address the problem of real-time 3D object detection from point cloud...
research
11/27/2019

PointRGCN: Graph Convolution Networks for 3D Vehicles Detection Refinement

In autonomous driving pipelines, perception modules provide a visual und...
research
08/27/2016

3D Object Proposals using Stereo Imagery for Accurate Object Class Detection

The goal of this paper is to perform 3D object detection in the context ...
research
01/07/2022

Extending One-Stage Detection with Open-World Proposals

In many applications, such as autonomous driving, hand manipulation, or ...
research
04/02/2020

DOPS: Learning to Detect 3D Objects and Predict their 3D Shapes

We propose DOPS, a fast single-stage 3D object detection method for LIDA...

Please sign up or login with your details

Forgot password? Click here to reset