Meme Sentiment Analysis Enhanced with Multimodal Spatial Encoding and Facial Embedding

03/03/2023
by   Muzhaffar Hazman, et al.
0

Internet memes are characterised by the interspersing of text amongst visual elements. State-of-the-art multimodal meme classifiers do not account for the relative positions of these elements across the two modalities, despite the latent meaning associated with where text and visual elements are placed. Against two meme sentiment classification datasets, we systematically show performance gains from incorporating the spatial position of visual objects, faces, and text clusters extracted from memes. In addition, we also present facial embedding as an impactful enhancement to image representation in a multimodal meme classifier. Finally, we show that incorporating this spatial information allows our fully automated approaches to outperform their corresponding baselines that rely on additional human validation of OCR-extracted text.

READ FULL TEXT
research
07/29/2017

Benchmarking Multimodal Sentiment Analysis

We propose a framework for multimodal sentiment analysis and emotion rec...
research
02/03/2018

Multimodal Sentiment Analysis with Word-Level Fusion and Reinforcement Learning

With the increasing popularity of video sharing websites such as YouTube...
research
08/01/2023

Unimodal Intermediate Training for Multimodal Meme Sentiment Classification

Internet Memes remain a challenging form of user-generated content for a...
research
07/21/2020

IITK at SemEval-2020 Task 8: Unimodal and Bimodal Sentiment Analysis of Internet Memes

Social media is abundant in visual and textual information presented tog...
research
07/17/2021

M2Lens: Visualizing and Explaining Multimodal Models for Sentiment Analysis

Multimodal sentiment analysis aims to recognize people's attitudes from ...
research
12/15/2021

Quantitative analysis of visual representation of sign elements in COVID-19 context

Representation is the way in which human beings re-present the reality o...
research
03/26/2021

DBATES: DataBase of Audio features, Text, and visual Expressions in competitive debate Speeches

In this work, we present a database of multimodal communication features...

Please sign up or login with your details

Forgot password? Click here to reset