MMFN: Multi-Modal-Fusion-Net for End-to-End Driving

07/01/2022
by   Qingwen Zhang, et al.
0

Inspired by the fact that humans use diverse sensory organs to perceive the world, sensors with different modalities are deployed in end-to-end driving to obtain the global context of the 3D scene. In previous works, camera and LiDAR inputs are fused through transformers for better driving performance. These inputs are normally further interpreted as high-level map information to assist navigation tasks. Nevertheless, extracting useful information from the complex map input is challenging, for redundant information may mislead the agent and negatively affect driving performance. We propose a novel approach to efficiently extract features from vectorized High-Definition (HD) maps and utilize them in the end-to-end driving tasks. In addition, we design a new expert to further enhance the model performance by considering multi-road rules. Experimental results prove that both of the proposed improvements enable our agent to achieve superior performance compared with other methods.

READ FULL TEXT
research
08/27/2020

Multi-View Fusion of Sensor Data for Improved Perception and Prediction in Autonomous Driving

We present an end-to-end method for object detection and trajectory pred...
research
08/02/2023

Interpretable End-to-End Driving Model for Implicit Scene Understanding

Driving scene understanding is to obtain comprehensive scene information...
research
09/11/2023

FusionFormer: A Multi-sensory Fusion in Bird's-Eye-View and Temporal Consistent Transformer for 3D Objection

Multi-sensor modal fusion has demonstrated strong advantages in 3D objec...
research
08/18/2021

End-to-End Urban Driving by Imitating a Reinforcement Learning Coach

End-to-end approaches to autonomous driving commonly rely on expert demo...
research
06/30/2022

LiDAR-as-Camera for End-to-End Driving

The core task of any autonomous driving system is to transform sensory i...
research
11/25/2018

Variational End-to-End Navigation and Localization

Deep learning has revolutionized the ability to learn "end-to-end" auton...
research
01/18/2021

MP3: A Unified Model to Map, Perceive, Predict and Plan

High-definition maps (HD maps) are a key component of most modern self-d...

Please sign up or login with your details

Forgot password? Click here to reset