ConQueR: Query Contrast Voxel-DETR for 3D Object Detection

12/14/2022
by   Benjin Zhu, et al.
0

Although DETR-based 3D detectors can simplify the detection pipeline and achieve direct sparse predictions, their performance still lags behind dense detectors with post-processing for 3D object detection from point clouds. DETRs usually adopt a larger number of queries than GTs (e.g., 300 queries v.s. 40 objects in Waymo) in a scene, which inevitably incur many false positives during inference. In this paper, we propose a simple yet effective sparse 3D detector, named Query Contrast Voxel-DETR (ConQueR), to eliminate the challenging false positives, and achieve more accurate and sparser predictions. We observe that most false positives are highly overlapping in local regions, caused by the lack of explicit supervision to discriminate locally similar queries. We thus propose a Query Contrast mechanism to explicitly enhance queries towards their best-matched GTs over all unmatched query predictions. This is achieved by the construction of positive and negative GT-query pairs for each GT, and a contrastive loss to enhance positive GT-query pairs against negative ones based on feature similarities. ConQueR closes the gap of sparse and dense 3D detectors, and reduces up to  60 single-frame ConQueR achieves new state-of-the-art (sota) 71.6 mAPH/L2 on the challenging Waymo Open Dataset validation set, outperforming previous sota methods (e.g., PV-RCNN++) by over 2.0 mAPH/L2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/16/2020

False Detection (Positives and Negatives) in Object Detection

Object detection is a very important function of visual perception syste...
research
03/22/2023

Dense Distinct Query for End-to-End Object Detection

One-to-one label assignment in object detection has successfully obviate...
research
08/18/2023

SparseBEV: High-Performance Sparse 3D Object Detection from Multi-Camera Videos

Camera-based 3D object detection in BEV (Bird's Eye View) space has draw...
research
03/20/2023

VoxelNeXt: Fully Sparse VoxelNet for 3D Object Detection and Tracking

3D object detectors usually rely on hand-crafted proxies, e.g., anchors ...
research
03/18/2021

SparsePoint: Fully End-to-End Sparse 3D Object Detector

Object detectors based on sparse object proposals have recently been pro...
research
03/19/2018

Revisiting RCNN: On Awakening the Classification Power of Faster RCNN

Recent region-based object detectors are usually built with separate cla...
research
02/19/2015

Visualizing Object Detection Features

We introduce algorithms to visualize feature spaces used by object detec...

Please sign up or login with your details

Forgot password? Click here to reset