Center Feature Fusion: Selective Multi-Sensor Fusion of Center-based Objects

09/26/2022
by   Philip Jacobson, et al.
0

Leveraging multi-modal fusion, especially between camera and LiDAR, has become essential for building accurate and robust 3D object detection systems for autonomous vehicles. Until recently, point decorating approaches, in which point clouds are augmented with camera features, have been the dominant approach in the field. However, these approaches fail to utilize the higher resolution images from cameras. Recent works projecting camera features to the bird's-eye-view (BEV) space for fusion have also been proposed, however they require projecting millions of pixels, most of which only contain background information. In this work, we propose a novel approach Center Feature Fusion (CFF), in which we leverage center-based detection networks in both the camera and LiDAR streams to identify relevant object locations. We then use the center-based detection to identify the locations of pixel features relevant to object locations, a small fraction of the total number in the image. These are then projected and fused in the BEV frame. On the nuScenes dataset, we outperform the LiDAR-only baseline by 4.9 features than other fusion methods.

READ FULL TEXT

page 2

page 4

page 5

research
09/09/2020

RoIFusion: 3D Object Detection from LiDAR and Vision

When localizing and detecting 3D objects for autonomous driving scenes, ...
research
06/16/2022

A Simple Baseline for BEV Perception Without LiDAR

Building 3D perception systems for autonomous vehicles that do not rely ...
research
09/12/2022

Multi-modal Streaming 3D Object Detection

Modern autonomous vehicles rely heavily on mechanical LiDARs for percept...
research
03/15/2021

3D-FFS: Faster 3D object detection with Focused Frustum Search in sensor fusion based networks

In this work we propose 3D-FFS, a novel approach to make sensor fusion b...
research
11/14/2019

PI-RCNN: An Efficient Multi-sensor 3D Object Detector with Point-based Attentive Cont-conv Fusion Module

LIDAR point clouds and RGB-images are both extremely essential for 3D ob...
research
11/01/2021

VPFNet: Voxel-Pixel Fusion Network for Multi-class 3D Object Detection

Many LiDAR-based methods for detecting large objects, single-class objec...
research
09/06/2023

3D Object Positioning Using Differentiable Multimodal Learning

This article describes a multi-modal method using simulated Lidar data v...

Please sign up or login with your details

Forgot password? Click here to reset