Uncertainty-Encoded Multi-Modal Fusion for Robust Object Detection in Autonomous Driving

07/30/2023
by   Yang Lou, et al.
0

Multi-modal fusion has shown initial promising results for object detection of autonomous driving perception. However, many existing fusion schemes do not consider the quality of each fusion input and may suffer from adverse conditions on one or more sensors. While predictive uncertainty has been applied to characterize single-modal object detection performance at run time, incorporating uncertainties into the multi-modal fusion still lacks effective solutions due primarily to the uncertainty's cross-modal incomparability and distinct sensitivities to various adverse conditions. To fill this gap, this paper proposes Uncertainty-Encoded Mixture-of-Experts (UMoE) that explicitly incorporates single-modal uncertainties into LiDAR-camera fusion. UMoE uses individual expert network to process each sensor's detection result together with encoded uncertainty. Then, the expert networks' outputs are analyzed by a gating network to determine the fusion weights. The proposed UMoE module can be integrated into any proposal fusion pipeline. Evaluation shows that UMoE achieves a maximum of 10.67 the state-of-the-art proposal-level multi-modal object detectors under extreme weather, adversarial, and blinding attack scenarios.

READ FULL TEXT

page 2

page 4

page 10

research
02/01/2020

Leveraging Uncertainties for Deep Multi-modal Object Detection in Autonomous Driving

This work presents a probabilistic deep neural network that combines LiD...
research
08/20/2023

ThermRad: A Multi-modal Dataset for Robust 3D Object Detection under Challenging Conditions

Robust 3D object detection in extreme weather and illumination condition...
research
03/17/2023

GOOD: General Optimization-based Fusion for 3D Object Detection via LiDAR-Camera Object Candidates

3D object detection serves as the core basis of the perception tasks in ...
research
05/22/2018

A scene perception system for visually impaired based on object detection and classification using multi-modal DCNN

This paper represents a cost-effective scene perception system aimed tow...
research
09/18/2020

Multi-modal Experts Network for Autonomous Driving

End-to-end learning from sensory data has shown promising results in aut...
research
08/23/2022

DeepInteraction: 3D Object Detection via Modality Interaction

Existing top-performance 3D object detectors typically rely on the multi...
research
09/17/2022

RGB-Event Fusion for Moving Object Detection in Autonomous Driving

Moving Object Detection (MOD) is a critical vision task for successfully...

Please sign up or login with your details

Forgot password? Click here to reset