Fishing for Clickbaits in Social Images and Texts with Linguistically-Infused Neural Network Models

10/17/2017
by   Maria Glenski, et al.
0

This paper presents the results and conclusions of our participation in the Clickbait Challenge 2017 on automatic clickbait detection in social media. We first describe linguistically-infused neural network models and identify informative representations to predict the level of clickbaiting present in Twitter posts. Our models allow to answer the question not only whether a post is a clickbait or not, but to what extent it is a clickbait post e.g., not at all, slightly, considerably, or heavily clickbaity using a score ranging from 0 to 1. We evaluate the predictive power of models trained on varied text and image representations extracted from tweets. Our best performing model that relies on the tweet text and linguistic markers of biased language extracted from the tweet and the corresponding page yields mean squared error (MSE) of 0.04, mean absolute error (MAE) of 0.16 and R2 of 0.43 on the held-out test data. For the binary classification setup (clickbait vs. non-clickbait), our model achieved F1 score of 0.69. We have not found that image representations combined with text yield significant performance improvement yet. Nevertheless, this work is the first to present preliminary analysis of objects extracted using Google Tensorflow object detection API from images in clickbait vs. non-clickbait Twitter posts. Finally, we outline several steps to improve model performance as a part of the future work.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/01/2023

Floods Relevancy and Identification of Location from Twitter Posts using NLP Techniques

This paper presents our solutions for the MediaEval 2022 task on Disaste...
research
07/20/2021

Checkovid: A COVID-19 misinformation detection system on Twitter using network and content mining perspectives

During the COVID-19 pandemic, social media platforms were ideal for comm...
research
06/16/2022

JU_NLP at HinglishEval: Quality Evaluation of the Low-Resource Code-Mixed Hinglish Text

In this paper we describe a system submitted to the INLG 2022 Generation...
research
04/05/2019

CLEARumor at SemEval-2019 Task 7: ConvoLving ELMo Against Rumors

This paper describes our submission to SemEval-2019 Task 7: RumourEval: ...
research
10/24/2017

Clickbait Identification using Neural Networks

This paper presents the results of our participation in the Clickbait De...
research
08/01/2021

You too Brutus! Trapping Hateful Users in Social Media: Challenges, Solutions Insights

Hate speech is regarded as one of the crucial issues plaguing the online...

Please sign up or login with your details

Forgot password? Click here to reset