Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets

09/20/2023
by   Yida Mu, et al.
0

A crucial aspect of a rumor detection model is its ability to generalize, particularly its ability to detect emerging, previously unknown rumors. Past research has indicated that content-based (i.e., using solely source posts as input) rumor detection models tend to perform less effectively on unseen rumors. At the same time, the potential of context-based models remains largely untapped. The main contribution of this paper is in the in-depth evaluation of the performance gap between content and context-based models specifically on detecting new, unseen rumors. Our empirical findings demonstrate that context-based models are still overly dependent on the information derived from the rumors' source post and tend to overlook the significant role that contextual information can play. We also study the effect of data split strategies on classifier performance. Based on our experimental results, the paper also offers practical suggestions on how to minimize the effects of temporal concept drift in static datasets during the training of rumor detection methods.

READ FULL TEXT

page 6

page 8

page 12

research
02/06/2023

It's about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits

New events emerge over time influencing the topics of rumors in social m...
research
11/27/2021

Abusive and Threatening Language Detection in Urdu using Boosting based and BERT based models: A Comparative Approach

Online hatred is a growing concern on many social media platforms. To ad...
research
11/19/2021

Toxicity Detection can be Sensitive to the Conversational Context

User posts whose perceived toxicity depends on the conversational contex...
research
12/19/2018

Cyberbullying Detection in Social Networks Using Deep Learning Based Models; A Reproducibility Study

Cyberbullying is a disturbing online misbehaviour with troubling consequ...
research
11/30/2022

RAFT: Rationale adaptor for few-shot abusive language detection

Abusive language is a concerning problem in online social media. Past re...
research
08/02/2020

Trawling for Trolling: A Dataset

The ability to accurately detect and filter offensive content automatica...
research
09/25/2018

LOBO -- Evaluation of Generalization Deficiencies in Twitter Bot Classifiers

Botnets in online social networks are increasingly often affecting the r...

Please sign up or login with your details

Forgot password? Click here to reset