Identifying Misinformation on YouTube through Transcript Contextual Analysis with Transformer Models

07/22/2023
by   Christos Christodoulou, et al.
0

Misinformation on YouTube is a significant concern, necessitating robust detection strategies. In this paper, we introduce a novel methodology for video classification, focusing on the veracity of the content. We convert the conventional video classification task into a text classification task by leveraging the textual content derived from the video transcripts. We employ advanced machine learning techniques like transfer learning to solve the classification challenge. Our approach incorporates two forms of transfer learning: (a) fine-tuning base transformer models such as BERT, RoBERTa, and ELECTRA, and (b) few-shot learning using sentence-transformers MPNet and RoBERTa-large. We apply the trained models to three datasets: (a) YouTube Vaccine-misinformation related videos, (b) YouTube Pseudoscience videos, and (c) Fake-News dataset (a collection of articles). Including the Fake-News dataset extended the evaluation of our approach beyond YouTube videos. Using these datasets, we evaluated the models distinguishing valid information from misinformation. The fine-tuned models yielded Matthews Correlation Coefficient>0.81, accuracy>0.90, and F1 score>0.90 in two of three datasets. Interestingly, the few-shot models outperformed the fine-tuned ones by 20 both Accuracy and F1 score for the YouTube Pseudoscience dataset, highlighting the potential utility of this approach – especially in the context of limited training data.

READ FULL TEXT
research
10/31/2019

Transfer Learning from Transformers to Fake News Challenge Stance Detection (FNC-1) Task

In this paper, we report improved results of the Fake News Challenge Sta...
research
01/28/2021

A transformer based approach for fighting COVID-19 fake news

The rapid outbreak of COVID-19 has caused humanity to come to a stand-st...
research
09/09/2023

Analysis of Disinformation and Fake News Detection Using Fine-Tuned Large Language Model

The paper considers the possibility of fine-tuning Llama 2 large languag...
research
06/13/2016

MITRE at SemEval-2016 Task 6: Transfer Learning for Stance Detection

We describe MITRE's submission to the SemEval-2016 Task 6, Detecting Sta...
research
05/15/2022

Evaluating Generalizability of Fine-Tuned Models for Fake News Detection

The Covid-19 pandemic has caused a dramatic and parallel rise in dangero...
research
02/15/2021

Identifying Misinformation from Website Screenshots

Can the look and the feel of a website give information about the trustw...
research
04/16/2023

MisRoBÆRTa: Transformers versus Misinformation

Misinformation is considered a threat to our democratic values and princ...

Please sign up or login with your details

Forgot password? Click here to reset