A novel multimodal fusion network based on a joint coding model for lane line segmentation

03/20/2021
by   Zhenhong Zou, et al.
4

There has recently been growing interest in utilizing multimodal sensors to achieve robust lane line segmentation. In this paper, we introduce a novel multimodal fusion architecture from an information theory perspective, and demonstrate its practical utility using Light Detection and Ranging (LiDAR) camera fusion networks. In particular, we develop, for the first time, a multimodal fusion network as a joint coding model, where each single node, layer, and pipeline is represented as a channel. The forward propagation is thus equal to the information transmission in the channels. Then, we can qualitatively and quantitatively analyze the effect of different fusion approaches. We argue the optimal fusion architecture is related to the essential capacity and its allocation based on the source and channel. To test this multimodal fusion hypothesis, we progressively determine a series of multimodal models based on the proposed fusion methods and evaluate them on the KITTI and the A2D2 datasets. Our optimal fusion network achieves 85 accuracy and 98.7 continuing future research into development of optimal fusion algorithms for the deep multimodal learning community.

READ FULL TEXT

page 3

page 16

page 24

research
08/11/2021

Learning Deep Multimodal Feature Representation with Asymmetric Multi-layer Fusion

We propose a compact and effective framework to fuse multimodal features...
research
02/17/2023

Tensorized Optical Multimodal Fusion Network

We propose the first tensorized optical multimodal fusion network archit...
research
06/03/2023

Provable Dynamic Fusion for Low-Quality Multimodal Data

The inherent challenge of multimodal fusion is to precisely capture the ...
research
09/06/2022

Finger Multimodal Feature Fusion and Recognition Based on Channel Spatial Attention

Due to the instability and limitations of unimodal biometric systems, mu...
research
04/17/2018

Deep Multimodal Subspace Clustering Networks

We present convolutional neural network (CNN) based approaches for unsup...
research
01/03/2019

A Network-based Multimodal Data Fusion Approach for Characterizing Dynamic Multimodal Physiological Patterns

Characterizing the dynamic interactive patterns of complex systems helps...
research
12/18/2021

Multiple Time Series Fusion Based on LSTM An Application to CAP A Phase Classification Using EEG

Biomedical decision making involves multiple signal processing, either f...

Please sign up or login with your details

Forgot password? Click here to reset