Progressive Fusion for Multimodal Integration

09/01/2022
by   Shiv Shankar, et al.
2

Integration of multimodal information from various sources has been shown to boost the performance of machine learning models and thus has received increased attention in recent years. Often such models use deep modality-specific networks to obtain unimodal features which are combined to obtain "late-fusion" representations. However, these designs run the risk of information loss in the respective unimodal pipelines. On the other hand, "early-fusion" methodologies, which combine features early, suffer from the problems associated with feature heterogeneity and high sample complexity. In this work, we present an iterative representation refinement approach, called Progressive Fusion, which mitigates the issues with late fusion representations. Our model-agnostic technique introduces backward connections that make late stage fused representations available to early layers, improving the expressiveness of the representations at those stages, while retaining the advantages of late fusion designs. We test Progressive Fusion on tasks including affective sentiment detection, multimedia analysis, and time series fusion with different models, demonstrating its versatility. We show that our approach consistently improves performance, for instance attaining a 5 reduction in MSE and 40 prediction.

READ FULL TEXT

page 4

page 21

research
08/13/2019

Variational Fusion for Multimodal Sentiment Analysis

Multimodal fusion is considered a key step in multimodal tasks such as s...
research
04/19/2023

MMDR: A Result Feature Fusion Object Detection Approach for Autonomous System

Object detection has been extensively utilized in autonomous systems in ...
research
04/08/2021

Multimodal Fusion Refiner Networks

Tasks that rely on multi-modal information typically include a fusion mo...
research
01/24/2022

MMLatch: Bottom-up Top-down Fusion for Multimodal Sentiment Analysis

Current deep learning approaches for multimodal fusion rely on bottom-up...
research
04/22/2021

Neuro-inspired edge feature fusion using Choquet integrals

It is known that the human visual system performs a hierarchical informa...
research
04/30/2014

Majority Vote of Diverse Classifiers for Late Fusion

In the past few years, a lot of attention has been devoted to multimedia...
research
04/27/2020

A general approach to progressive learning

In biological learning, data is used to improve performance on the task ...

Please sign up or login with your details

Forgot password? Click here to reset