CentralNet: a Multilayer Approach for Multimodal Fusion

08/22/2018
by   Valentin Vielzeuf, et al.
4

This paper proposes a novel multimodal fusion approach, aiming to produce best possible decisions by integrating information coming from multiple media. While most of the past multimodal approaches either work by projecting the features of different modalities into the same space, or by coordinating the representations of each modality through the use of constraints, our approach borrows from both visions. More specifically, assuming each modality can be processed by a separated deep convolutional network, allowing to take decisions independently from each modality, we introduce a central network linking the modality specific networks. This central network not only provides a common feature embedding but also regularizes the modality specific networks through the use of multi-task learning. The proposed approach is validated on 4 different computer vision tasks on which it consistently improves the accuracy of existing multimodal fusion approaches.

READ FULL TEXT

page 8

page 11

page 12

research
07/03/2018

Multi-Level Feature Abstraction from Convolutional Neural Networks for Multimodal Biometric Identification

In this paper, we propose a deep multimodal fusion network to fuse multi...
research
11/20/2019

MMTM: Multimodal Transfer Module for CNN Fusion

In late fusion, each modality is processed in a separate unimodal Convol...
research
07/07/2020

3D Shape Reconstruction from Vision and Touch

When a toddler is presented a new toy, their instinctual behaviour is to...
research
06/25/2022

Defending Multimodal Fusion Models against Single-Source Adversaries

Beyond achieving high performance across many vision tasks, multimodal m...
research
09/15/2023

One-stage Modality Distillation for Incomplete Multimodal Learning

Learning based on multimodal data has attracted increasing interest rece...
research
01/25/2022

Multi-channel Attentive Graph Convolutional Network With Sentiment Fusion For Multimodal Sentiment Analysis

Nowadays, with the explosive growth of multimodal reviews on social medi...
research
07/03/2018

Generalized Bilinear Deep Convolutional Neural Networks for Multimodal Biometric Identification

In this paper, we propose to employ a bank of modality-dedicated Convolu...

Please sign up or login with your details

Forgot password? Click here to reset