A Semi-Supervised Framework for Misinformation Detection

04/22/2023
by   Yueyang Liu, et al.
0

The spread of misinformation in social media outlets has become a prevalent societal problem and is the cause of many kinds of social unrest. Curtailing its prevalence is of great importance and machine learning has shown significant promise. However, there are two main challenges when applying machine learning to this problem. First, while much too prevalent in one respect, misinformation, actually, represents only a minor proportion of all the postings seen on social media. Second, labeling the massive amount of data necessary to train a useful classifier becomes impractical. Considering these challenges, we propose a simple semi-supervised learning framework in order to deal with extreme class imbalances that has the advantage, over other approaches, of using actual rather than simulated data to inflate the minority class. We tested our framework on two sets of Covid-related Twitter data and obtained significant improvement in F1-measure on extremely imbalanced scenarios, as compared to simple classical and deep-learning data generation methods such as SMOTE, ADASYN, or GAN-based data generation.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2020

Independent Component Analysis for Trustworthy Cyberspace during High Impact Events: An Application to Covid-19

Social media has become an important communication channel during high i...
research
12/02/2022

Fake detection in imbalance dataset by Semi-supervised learning with GAN

As social media grows faster, harassment becomes more prevalent which le...
research
02/23/2017

A Probabilistic Framework for Location Inference from Social Media

We study the extent to which we can infer users' geographical locations ...
research
01/29/2019

A semi-supervised approach to message stance classification

Social media communications are becoming increasingly prevalent; some us...
research
06/22/2021

Simulation-Driven COVID-19 Epidemiological Modeling with Social Media

Modern Bayesian approaches and workflows emphasize in how simulation is ...
research
11/04/2018

Semi-Supervised Confidence Network aided Gated Attention based Recurrent Neural Network for Clickbait Detection

Clickbaits are catchy headlines that are frequently used by social media...

Please sign up or login with your details

Forgot password? Click here to reset