Pillar R-CNN for Point Cloud 3D Object Detection

02/26/2023
by   Guangsheng Shi, et al.
0

The performance of point cloud 3D object detection hinges on effectively representing raw points, grid-based voxels or pillars. Recent two-stage 3D detectors typically take the point-voxel-based R-CNN paradigm, i.e., the first stage resorts to the 3D voxel-based backbone for 3D proposal generation on bird-eye-view (BEV) representation and the second stage refines them via the intermediate point representation. Their primary mechanisms involve the utilization of intermediary keypoints to restore the substantial 3D structure context from the converted BEV representation. The skilled point-voxel feature interaction, however, makes the entire detection pipeline more complex and compute-intensive. In this paper, we take a different viewpoint – the pillar-based BEV representation owns sufficient capacity to preserve the 3D structure. In light of the latest advances in BEV-based perception, we devise a conceptually simple yet effective two-stage 3D detection architecture, named Pillar R-CNN. On top of densified BEV feature maps, Pillar R-CNN can easily introduce the feature pyramid architecture to generate 3D proposals at various scales and take the simple 2D R-CNN style detect head for box refinement. Our Pillar R-CNN performs favorably against state-of-the-art 3D detectors on the large-scale Waymo Open Dataset but at a small extra cost. It should be highlighted that further exploration into BEV perception for applications involving autonomous driving is now possible thanks to the effective and elegant Pillar R-CNN architecture.

READ FULL TEXT
research
12/31/2020

Voxel R-CNN: Towards High Performance Voxel-based 3D Object Detection

Recent advances on 3D object detection heavily rely on how the 3D data a...
research
08/08/2019

Fast Point R-CNN

We present a unified, efficient and effective framework for point-cloud ...
research
11/01/2021

Structure Information is the Key: Self-Attention RoI Feature Extractor in 3D Object Detection

Unlike 2D object detection where all RoI features come from grid pixels,...
research
05/16/2022

PillarNet: Real-Time and High-Performance Pillar-based 3D Object Detection

Real-time and high-performance 3D object detection is of critical import...
research
03/02/2022

A Unified Query-based Paradigm for Point Cloud Understanding

3D point cloud understanding is an important component in autonomous dri...
research
08/15/2023

Exploiting Sparsity in Automotive Radar Object Detection Networks

Having precise perception of the environment is crucial for ensuring the...
research
09/20/2022

Rethinking Dimensionality Reduction in Grid-based 3D Object Detection

Bird's eye view (BEV) is widely adopted by most of the current point clo...

Please sign up or login with your details

Forgot password? Click here to reset