Fast-BEV: A Fast and Strong Bird's-Eye View Perception Baseline

01/29/2023
by   Yangguang Li, et al.
0

Recently, perception task based on Bird's-Eye View (BEV) representation has drawn more and more attention, and BEV representation is promising as the foundation for next-generation Autonomous Vehicle (AV) perception. However, most existing BEV solutions either require considerable resources to execute on-vehicle inference or suffer from modest performance. This paper proposes a simple yet effective framework, termed Fast-BEV , which is capable of performing faster BEV perception on the on-vehicle chips. Towards this goal, we first empirically find that the BEV representation can be sufficiently powerful without expensive transformer based transformation nor depth representation. Our Fast-BEV consists of five parts, We novelly propose (1) a lightweight deployment-friendly view transformation which fast transfers 2D image feature to 3D voxel space, (2) an multi-scale image encoder which leverages multi-scale information for better performance, (3) an efficient BEV encoder which is particularly designed to speed up on-vehicle inference. We further introduce (4) a strong data augmentation strategy for both image and BEV space to avoid over-fitting, (5) a multi-frame feature fusion mechanism to leverage the temporal information. Through experiments, on 2080Ti platform, our R50 model can run 52.6 FPS with 47.3 41.3 FPS and 47.5 the BEVDet4D-R50 model. Our largest model (R101@900x1600) establishes a competitive 53.5 benchmark with considerable accuracy and efficiency on current popular on-vehicle chips. The code is released at: https://github.com/Sense-GVT/Fast-BEV.

READ FULL TEXT

page 2

page 4

page 6

page 11

research
01/19/2023

Fast-BEV: Towards Real-time On-vehicle Bird's-Eye View Perception

Recently, the pure camera-based Bird's-Eye-View (BEV) perception removes...
research
02/15/2023

Tri-Perspective View for Vision-Based 3D Semantic Occupancy Prediction

Modern methods for vision-centric autonomous driving perception widely a...
research
09/23/2021

Keypoints-Based Deep Feature Fusion for Cooperative Vehicle Detection of Autonomous Driving

Sharing collective perception messages (CPM) between vehicles is investi...
research
11/19/2022

MatrixVT: Efficient Multi-Camera to BEV Transformation for 3D Perception

This paper proposes an efficient multi-camera to Bird's-Eye-View (BEV) v...
research
04/13/2023

RoboBEV: Towards Robust Bird's Eye View Perception under Corruptions

The recent advances in camera-based bird's eye view (BEV) representation...
research
06/16/2022

A Simple Baseline for BEV Perception Without LiDAR

Building 3D perception systems for autonomous vehicles that do not rely ...
research
06/09/2022

Efficient and Robust 2D-to-BEV Representation Learning via Geometry-guided Kernel Transformer

Learning Bird's Eye View (BEV) representation from surrounding-view came...

Please sign up or login with your details

Forgot password? Click here to reset