Multi-Camera Calibration Free BEV Representation for 3D Object Detection

10/31/2022
by   Hongxiang Jiang, et al.
0

In advanced paradigms of autonomous driving, learning Bird's Eye View (BEV) representation from surrounding views is crucial for multi-task framework. However, existing methods based on depth estimation or camera-driven attention are not stable to obtain transformation under noisy camera parameters, mainly with two challenges, accurate depth prediction and calibration. In this work, we present a completely Multi-Camera Calibration Free Transformer (CFT) for robust BEV representation, which focuses on exploring implicit mapping, not relied on camera intrinsics and extrinsics. To guide better feature learning from image views to BEV, CFT mines potential 3D information in BEV via our designed position-aware enhancement (PA). Instead of camera-driven point-wise or global transformation, for interaction within more effective region and lower computation cost, we propose a view-aware attention which also reduces redundant computation and promotes converge. CFT achieves 49.7 nuScenes detection task leaderboard, which is the first work removing camera parameters, comparable to other geometry-guided methods. Without temporal input and other modal information, CFT achieves second highest performance with a smaller image input 1600 * 640. Thanks to view-attention variant, CFT reduces memory and transformer FLOPs for vanilla attention by about 12 respectively, with improved NDS by 1.0 noisy camera parameters makes CFT more competitive.

READ FULL TEXT

page 11

page 15

research
04/07/2022

SurroundDepth: Entangling Surrounding Views for Self-Supervised Multi-Camera Depth Estimation

Depth estimation from images serves as the fundamental step of 3D percep...
research
03/07/2023

Calibration-free BEV Representation for Infrastructure Perception

Effective BEV object detection on infrastructure can greatly improve tra...
research
04/06/2023

EGA-Depth: Efficient Guided Attention for Self-Supervised Multi-Camera Depth Estimation

The ubiquitous multi-camera setup on modern autonomous vehicles provides...
research
03/03/2023

Towards Domain Generalization for Multi-view 3D Object Detection in Bird-Eye-View

Multi-view 3D object detection (MV3D-Det) in Bird-Eye-View (BEV) has dra...
research
04/19/2023

UniCal: a Single-Branch Transformer-Based Model for Camera-to-LiDAR Calibration and Validation

We introduce a novel architecture, UniCal, for Camera-to-LiDAR (C2L) ext...
research
10/07/2021

Camera Calibration through Camera Projection Loss

Camera calibration is a necessity in various tasks including 3D reconstr...
research
12/22/2021

BEVDet: High-performance Multi-camera 3D Object Detection in Bird-Eye-View

Autonomous driving perceives the surrounding environment for decision ma...

Please sign up or login with your details

Forgot password? Click here to reset