Predict to Detect: Prediction-guided 3D Object Detection using Sequential Images

06/14/2023
by   Sanmin Kim, et al.
0

Recent camera-based 3D object detection methods have introduced sequential frames to improve the detection performance hoping that multiple frames would mitigate the large depth estimation error. Despite improved detection performance, prior works rely on naive fusion methods (e.g., concatenation) or are limited to static scenes (e.g., temporal stereo), neglecting the importance of the motion cue of objects. These approaches do not fully exploit the potential of sequential images and show limited performance improvements. To address this limitation, we propose a novel 3D object detection model, P2D (Predict to Detect), that integrates a prediction scheme into a detection framework to explicitly extract and leverage motion features. P2D predicts object information in the current frame using solely past frames to learn temporal motion features. We then introduce a novel temporal feature aggregation method that attentively exploits Bird's-Eye-View (BEV) features based on predicted object information, resulting in accurate 3D object detection. Experimental results demonstrate that P2D improves mAP and NDS by 3.0 that incorporating a prediction scheme can significantly improve detection accuracy.

READ FULL TEXT

page 3

page 5

page 8

research
09/14/2020

3D Object Detection and Tracking Based on Streaming Data

Recent approaches for 3D object detection have made tremendous progresse...
research
08/08/2023

SODFormer: Streaming Object Detection with Transformer Using Events and Frames

DAVIS camera, streaming two complementary sensing modalities of asynchro...
research
04/26/2023

Group Equivariant BEV for 3D Object Detection

Recently, 3D object detection has attracted significant attention and ac...
research
12/07/2019

Long Term Temporal Context for Per-Camera Object Detection

In static monitoring cameras, useful contextual information can stretch ...
research
03/29/2023

DORT: Modeling Dynamic Objects in Recurrent for Multi-Camera 3D Object Detection and Tracking

Recent multi-camera 3D object detectors usually leverage temporal inform...
research
10/17/2021

Revealing Disocclusions in Temporal View Synthesis through Infilling Vector Prediction

We consider the problem of temporal view synthesis, where the goal is to...
research
01/25/2023

On the Adversarial Robustness of Camera-based 3D Object Detection

In recent years, camera-based 3D object detection has gained widespread ...

Please sign up or login with your details

Forgot password? Click here to reset