FocalFormer3D : Focusing on Hard Instance for 3D Object Detection

08/08/2023
by   Yilun Chen, et al.
0

False negatives (FN) in 3D object detection, e.g., missing predictions of pedestrians, vehicles, or other obstacles, can lead to potentially dangerous situations in autonomous driving. While being fatal, this issue is understudied in many current 3D detection methods. In this work, we propose Hard Instance Probing (HIP), a general pipeline that identifies FN in a multi-stage manner and guides the models to focus on excavating difficult instances. For 3D object detection, we instantiate this method as FocalFormer3D, a simple yet effective detector that excels at excavating difficult objects and improving prediction recall. FocalFormer3D features a multi-stage query generation to discover hard objects and a box-level transformer decoder to efficiently distinguish objects from massive object candidates. Experimental results on the nuScenes and Waymo datasets validate the superior performance of FocalFormer3D. The advantage leads to strong performance on both detection and tracking, in both LiDAR and multi-modal settings. Notably, FocalFormer3D achieves a 70.5 mAP and 73.9 NDS on nuScenes detection benchmark, while the nuScenes tracking benchmark shows 72.1 AMOTA, both ranking 1st place on the nuScenes LiDAR leaderboard. Our code is available at <https://github.com/NVlabs/FocalFormer3D>.

READ FULL TEXT
research
12/31/2020

TransTrack: Multiple-Object Tracking with Transformer

Multiple-object tracking(MOT) is mostly dominated by complex and multi-s...
research
07/05/2023

Focusing on what to decode and what to train: Efficient Training with HOI Split Decoders and Specific Target Guided DeNoising

Recent one-stage transformer-based methods achieve notable gains in the ...
research
06/11/2020

Quasi-Dense Instance Similarity Learning

Similarity metrics for instances have drawn much attention, due to their...
research
04/09/2023

Curricular Object Manipulation in LiDAR-based Object Detection

This paper explores the potential of curriculum learning in LiDAR-based ...
research
10/13/2022

H2RBox: Horizonal Box Annotation is All You Need for Oriented Object Detection

Oriented object detection emerges in many applications from aerial image...
research
03/30/2020

RetinaTrack: Online Single Stage Joint Detection and Tracking

Traditionally multi-object tracking and object detection are performed u...
research
03/29/2022

Interactive Multi-Class Tiny-Object Detection

Annotating tens or hundreds of tiny objects in a given image is laboriou...

Please sign up or login with your details

Forgot password? Click here to reset