Survey on Deep Multi-modal Data Analytics: Collaboration, Rivalry and Fusion

06/15/2020
by   Yang Wang, et al.
0

With the development of web technology, multi-modal or multi-view data has surged as a major stream for big data, where each modal/view encodes individual property of data objects. Often, different modalities are complementary to each other. Such fact motivated a lot of research attention on fusing the multi-modal feature spaces to comprehensively characterize the data objects. Most of the existing state-of-the-art focused on how to fuse the energy or information from multi-modal spaces to deliver a superior performance over their counterparts with single modal. Recently, deep neural networks have exhibited as a powerful architecture to well capture the nonlinear distribution of high-dimensional multimedia data, so naturally does for multi-modal data. Substantial empirical studies are carried out to demonstrate its advantages that are benefited from deep multi-modal methods, which can essentially deepen the fusion from multi-modal deep feature spaces. In this paper, we provide a substantial overview of the existing state-of-the-arts on the filed of multi-modal data analytics from shallow to deep spaces. Throughout this survey, we further indicate that the critical components for this field go to collaboration, adversarial competition and fusion over multi-modal spaces. Finally, we share our viewpoints regarding some future directions on this field.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/11/2019

Multi-modal Deep Analysis for Multimedia

With the rapid development of Internet and multimedia services in the pa...
research
10/03/2016

Multi-View Representation Learning: A Survey from Shallow Methods to Deep Methods

Recently, multi-view representation learning has become a rapidly growin...
research
08/19/2023

Interpretation on Multi-modal Visual Fusion

In this paper, we present an analytical framework and a novel metric to ...
research
03/23/2022

Modality Competition: What Makes Joint Training of Multi-modal Network Fail in Deep Learning? (Provably)

Despite the remarkable success of deep multi-modal learning in practice,...
research
03/03/2020

Deep Multi-Modal Sets

Many vision-related tasks benefit from reasoning over multiple modalitie...
research
12/02/2022

MMBench: Benchmarking End-to-End Multi-modal DNNs and Understanding Their Hardware-Software Implications

The explosive growth of various types of big data and advances in AI tec...
research
10/26/2018

Investigating non-classical correlations between decision fused multi-modal documents

Correlation has been widely used to facilitate various information retri...

Please sign up or login with your details

Forgot password? Click here to reset