OA-BEV: Bringing Object Awareness to Bird's-Eye-View Representation for Multi-Camera 3D Object Detection

01/13/2023
by   Xiaomeng Chu, et al.
0

The recent trend for multi-camera 3D object detection is through the unified bird's-eye view (BEV) representation. However, directly transforming features extracted from the image-plane view to BEV inevitably results in feature distortion, especially around the objects of interest, making the objects blur into the background. To this end, we propose OA-BEV, a network that can be plugged into the BEV-based 3D object detection framework to bring out the objects by incorporating object-aware pseudo-3D features and depth features. Such features contain information about the object's position and 3D structures. First, we explicitly guide the network to learn the depth distribution by object-level supervision from each 3D object's center. Then, we select the foreground pixels by a 2D object detector and project them into 3D space for pseudo-voxel feature encoding. Finally, the object-aware depth features and pseudo-voxel features are incorporated into the BEV representation with a deformable attention mechanism. We conduct extensive experiments on the nuScenes dataset to validate the merits of our proposed OA-BEV. Our method achieves consistent improvements over the BEV-based baselines in terms of both average precision and nuScenes detection score. Our codes will be published.

READ FULL TEXT

page 1

page 3

page 7

research
04/22/2022

DFAM-DETR: Deformable feature based attention mechanism DETR on slender object detection

Object detection is one of the most significant aspects of computer visi...
research
03/04/2022

A Versatile Multi-View Framework for LiDAR-based 3D Object Detection with Guidance from Panoptic Segmentation

3D object detection using LiDAR data is an indispensable component for a...
research
12/11/2022

Focal-PETR: Embracing Foreground for Efficient Multi-Camera 3D Object Detection

The dominant multi-camera 3D detection paradigm is based on explicit 3D ...
research
03/03/2023

Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

Multi-view 3D object detection (MV3D-Det) in Bird-Eye-View (BEV) has dra...
research
07/16/2022

Consistency of Implicit and Explicit Features Matters for Monocular 3D Object Detection

Monocular 3D object detection is a common solution for low-cost autonomo...
research
07/09/2023

Parametric Depth Based Feature Representation Learning for Object Detection and Segmentation in Bird's Eye View

Recent vision-only perception models for autonomous driving achieved pro...
research
04/11/2022

M^2BEV: Multi-Camera Joint 3D Detection and Segmentation with Unified Birds-Eye View Representation

In this paper, we propose M^2BEV, a unified framework that jointly perfo...

Please sign up or login with your details

Forgot password? Click here to reset