AraCOVID19-MFH: Arabic COVID-19 Multi-label Fake News and Hate Speech Detection Dataset

Along with the COVID-19 pandemic, an "infodemic" of false and misleading information has emerged and has complicated the COVID-19 response efforts. Social networking sites such as Facebook and Twitter have contributed largely to the spread of rumors, conspiracy theories, hate, xenophobia, racism, and prejudice. To combat the spread of fake news, researchers around the world have and are still making considerable efforts to build and share COVID-19 related research articles, models, and datasets. This paper releases "AraCOVID19-MFH" a manually annotated multi-label Arabic COVID-19 fake news and hate speech detection dataset. Our dataset contains 10,828 Arabic tweets annotated with 10 different labels. The labels have been designed to consider some aspects relevant to the fact-checking task, such as the tweet's check worthiness, positivity/negativity, and factuality. To confirm our annotated dataset's practical utility, we used it to train and evaluate several classification models and reported the obtained results. Though the dataset is mainly designed for fake news detection, it can also be used for hate speech detection, opinion/news classification, dialect identification, and many other tasks.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/06/2022

Arabic Fake News Detection Based on Deep Contextualized Embedding Models

Social media is becoming a source of news for many people due to its eas...
research
10/05/2021

AraCOVID19-SSD: Arabic COVID-19 Sentiment and Sarcasm Detection Dataset

Coronavirus disease (COVID-19) is an infectious respiratory disease that...
research
06/19/2020

FakeCovid – A Multilingual Cross-domain Fact Check News Dataset for COVID-19

In this paper, we present a first multilingual cross-domain dataset of 5...
research
11/05/2020

Machine Generation and Detection of Arabic Manipulated and Fake News

Fake news and deceptive machine-generated text are serious problems thre...
research
10/17/2020

Drink bleach or do what now? Covid-HeRA: A dataset for risk-informed health decision making in the presence of COVID19 misinformation

Given the wide spread of inaccurate medical advice related to the 2019 c...
research
12/20/2020

Fake news agenda in the era of COVID-19: Identifying trends through fact-checking content

The rise of social media has ignited an unprecedented circulation of fal...
research
04/26/2020

Detecting fake news for the new coronavirus by reasoning on the Covid-19 ontology

In the context of the Covid-19 pandemic, many were quick to spread decep...

Please sign up or login with your details

Forgot password? Click here to reset