Borrowing Human Senses: Comment-Aware Self-Training for Social Media Multimodal Classification

03/27/2023
by   Chunpu Xu, et al.
0

Social media is daily creating massive multimedia content with paired image and text, presenting the pressing need to automate the vision and language understanding for various multimodal classification tasks. Compared to the commonly researched visual-lingual data, social media posts tend to exhibit more implicit image-text relations. To better glue the cross-modal semantics therein, we capture hinting features from user comments, which are retrieved via jointly leveraging visual and lingual similarity. Afterwards, the classification tasks are explored via self-training in a teacher-student framework, motivated by the usually limited labeled data scales in existing benchmarks. Substantial experiments are conducted on four multimodal social media benchmarks for image text relation classification, sarcasm detection, sentiment classification, and hate speech detection. The results show that our method further advances the performance of previous state-of-the-art models, which do not employ comment modeling or self-training.

READ FULL TEXT
research
09/14/2023

Improving Multimodal Classification of Social Media Posts by Leveraging Image-Text Auxiliary tasks

Effectively leveraging multimodal information from social media posts is...
research
04/28/2023

The Emotions of the Crowd: Learning Image Sentiment from Tweets via Cross-modal Distillation

Trends and opinion mining in social media increasingly focus on novel in...
research
05/17/2019

Deep Unified Multimodal Embeddings for Understanding both Content and Users in Social Media Networks

There has been an explosion of multimodal content generated on social me...
research
10/06/2022

Time Will Change Things: An Empirical Study on Dynamic Language Understanding in Social Media Classification

Language features are ever-evolving in the real-world social media envir...
research
01/11/2023

Few-shot Learning for Cross-Target Stance Detection by Aggregating Multimodal Embeddings

Despite the increasing popularity of the stance detection task, existing...
research
05/26/2022

MemeTector: Enforcing deep focus for meme detection

Image memes and specifically their widely-known variation image macros, ...
research
06/05/2020

A Dataset and Benchmarks for Multimedia Social Analysis

We present a new publicly available dataset with the goal of advancing m...

Please sign up or login with your details

Forgot password? Click here to reset