Multi-modal Sentiment Analysis using Deep Canonical Correlation Analysis

07/15/2019
by   Zhongkai Sun, et al.
0

This paper learns multi-modal embeddings from text, audio, and video views/modes of data in order to improve upon down-stream sentiment classification. The experimental framework also allows investigation of the relative contributions of the individual views in the final multi-modal embedding. Individual features derived from the three views are combined into a multi-modal embedding using Deep Canonical Correlation Analysis (DCCA) in two ways i) One-Step DCCA and ii) Two-Step DCCA. This paper learns text embeddings using BERT, the current state-of-the-art in text encoders. We posit that this highly optimized algorithm dominates over the contribution of other views, though each view does contribute to the final result. Classification tasks are carried out on two benchmark datasets and on a new Debate Emotion data set, and together these demonstrate that the one-Step DCCA outperforms the current state-of-the-art in learning multi-modal embeddings.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/13/2019

Learning Relationships between Text, Audio, and Video via Deep Canonical Correlation for Multimodal Language Analysis

Multimodal language analysis often considers relationships between featu...
research
11/05/2020

NUAA-QMUL at SemEval-2020 Task 8: Utilizing BERT and DenseNet for Internet Meme Emotion Analysis

This paper describes our contribution to SemEval 2020 Task 8: Memotion A...
research
06/06/2023

A Quantum Probability Driven Framework for Joint Multi-Modal Sarcasm, Sentiment and Emotion Analysis

Sarcasm, sentiment, and emotion are three typical kinds of spontaneous a...
research
10/26/2017

Deep Multi-Modal Classification of Intraductal Papillary Mucinous Neoplasms (IPMN) with Canonical Correlation Analysis

Pancreatic cancer has the poorest prognosis among all cancer types. Intr...
research
08/16/2021

Efficient Feature Representations for Cricket Data Analysis using Deep Learning based Multi-Modal Fusion Model

Data analysis has become a necessity in the modern era of cricket. Every...
research
11/21/2020

Deep learning for video game genre classification

Video game genre classification based on its cover and textual descripti...
research
07/13/2017

Discrete Multi-modal Hashing with Canonical Views for Robust Mobile Landmark Search

Mobile landmark search (MLS) recently receives increasing attention for ...

Please sign up or login with your details

Forgot password? Click here to reset