Few-shot Learning for Multi-modal Social Media Event Filtering

11/16/2022
by   José Nascimento, et al.
0

Social media has become an important data source for event analysis. When collecting this type of data, most contain no useful information to a target event. Thus, it is essential to filter out those noisy data at the earliest opportunity for a human expert to perform further inspection. Most existing solutions for event filtering rely on fully supervised methods for training. However, in many real-world scenarios, having access to large number of labeled samples is not possible. To deal with a few labeled sample training problem for event filtering, we propose a graph-based few-shot learning pipeline. We also release the Brazilian Protest Dataset to test our method. To the best of our knowledge, this dataset is the first of its kind in event filtering that focuses on protests in multi-modal social media data, with most of the text in Portuguese. Our experimental results show that our proposed pipeline has comparable performance with only a few labeled samples (60) compared with a fully labeled dataset (3100). To facilitate the research community, we make our dataset and code available at https://github.com/jdnascim/7Set-AL.

READ FULL TEXT

page 1

page 3

research
06/04/2021

A General Method for Event Detection on Social Media

Event detection on social media has attracted a number of researches, gi...
research
09/07/2023

Text-to-feature diffusion for audio-visual few-shot learning

Training deep learning models for video classification from audio-visual...
research
07/02/2023

Fraunhofer SIT at CheckThat! 2023: Mixing Single-Modal Classifiers to Estimate the Check-Worthiness of Multi-Modal Tweets

The option of sharing images, videos and audio files on social media ope...
research
08/11/2022

H4M: Heterogeneous, Multi-source, Multi-modal, Multi-view and Multi-distributional Dataset for Socioeconomic Analytics in the Case of Beijing

The study of socioeconomic status has been reformed by the availability ...
research
08/20/2020

VisualSem: a high-quality knowledge graph for vision and language

We argue that the next frontier in natural language understanding (NLU) ...
research
03/09/2022

A Weibo Dataset for the 2022 Russo-Ukrainian Crisis

Online social networks such as Twitter and Weibo play an important role ...
research
05/29/2023

TotalDefMeme: A Multi-Attribute Meme dataset on Total Defence in Singapore

Total Defence is a defence policy combining and extending the concept of...

Please sign up or login with your details

Forgot password? Click here to reset