DeepFusion: A Robust and Modular 3D Object Detector for Lidars, Cameras and Radars

09/26/2022
by   Florian Drews, et al.
0

We propose DeepFusion, a modular multi-modal architecture to fuse lidars, cameras and radars in different combinations for 3D object detection. Specialized feature extractors take advantage of each modality and can be exchanged easily, making the approach simple and flexible. Extracted features are transformed into bird's-eye-view as a common representation for fusion. Spatial and semantic alignment is performed prior to fusing modalities in the feature space. Finally, a detection head exploits rich multi-modal features for improved 3D detection performance. Experimental results for lidar-camera, lidar-camera-radar and camera-radar fusion show the flexibility and effectiveness of our fusion approach. In the process, we study the largely unexplored task of faraway car detection up to 225 meters, showing the benefits of our lidar-camera fusion. Furthermore, we investigate the required density of lidar points for 3D object detection and illustrate implications at the example of robustness against adverse weather conditions. Moreover, ablation studies on our camera-radar fusion highlight the importance of accurate depth estimation.

READ FULL TEXT

page 3

page 5

research
08/25/2022

Bridging the View Disparity of Radar and Camera Features for Multi-modal Fusion 3D Object Detection

Environmental perception with multi-modal fusion of radar and camera is ...
research
08/20/2023

ThermRad: A Multi-modal Dataset for Robust 3D Object Detection under Challenging Conditions

Robust 3D object detection in extreme weather and illumination condition...
research
04/19/2023

CrossFusion: Interleaving Cross-modal Complementation for Noise-resistant 3D Object Detection

The combination of LiDAR and camera modalities is proven to be necessary...
research
06/30/2022

HRFuser: A Multi-resolution Sensor Fusion Architecture for 2D Object Detection

Besides standard cameras, autonomous vehicles typically include multiple...
research
09/28/2021

Fail-Safe Human Detection for Drones Using a Multi-Modal Curriculum Learning Approach

Drones are currently being explored for safety-critical applications whe...
research
04/03/2023

CRN: Camera Radar Net for Accurate, Robust, Efficient 3D Perception

Autonomous driving requires an accurate and fast 3D perception system th...
research
12/15/2022

Multi-level and multi-modal feature fusion for accurate 3D object detection in Connected and Automated Vehicles

Aiming at highly accurate object detection for connected and automated v...

Please sign up or login with your details

Forgot password? Click here to reset