It's about Time: Rethinking Evaluation on Rumor Detection Benchmarks using Chronological Splits

02/06/2023
by   Yida Mu, et al.
0

New events emerge over time influencing the topics of rumors in social media. Current rumor detection benchmarks use random splits as training, development and test sets which typically results in topical overlaps. Consequently, models trained on random splits may not perform well on rumor classification on previously unseen topics due to the temporal concept drift. In this paper, we provide a re-evaluation of classification models on four popular rumor detection benchmarks considering chronological instead of random splits. Our experimental results show that the use of random splits can significantly overestimate predictive performance across all datasets and models. Therefore, we suggest that rumor detection models should always be evaluated using chronological splits for minimizing topical overlaps.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/10/2023

Examining Temporalities on Stance Detection Towards COVID-19 Vaccination

Previous studies have highlighted the importance of vaccination as an ef...
research
09/20/2023

Examining the Limitations of Computational Rumor Detection Models Trained on Static Datasets

A crucial aspect of a rumor detection model is its ability to generalize...
research
09/20/2022

Twitter Topic Classification

Social media platforms host discussions about a wide variety of topics t...
research
09/17/2019

Concept Drift Adaptive Physical Event Detection for Social Media Streams

Event detection has long been the domain of physical sensors operating i...
research
07/12/2020

Stance Detection in Web and Social Media: A Comparative Study

Online forums and social media platforms are increasingly being used to ...
research
10/06/2022

Time Will Change Things: An Empirical Study on Dynamic Language Understanding in Social Media Classification

Language features are ever-evolving in the real-world social media envir...
research
05/01/2020

We Need to Talk About Random Splits

Gorman and Bedrick (2019) recently argued for using random splits rather...

Please sign up or login with your details

Forgot password? Click here to reset