ArCOV-19: The First Arabic COVID-19 Twitter Dataset with Propagation Networks

04/13/2020
by   Fatima Haouari, et al.
0

In this paper, we present ArCOV-19, an Arabic COVID-19 Twitter dataset that covers the period from 27^th of January till 31^st of March 2020. ArCOV-19 is the first publicly-available Arabic Twitter dataset covering COVID-19 pandemic that includes around 748k popular tweets (according to Twitter search criterion) alongside the propagation networks of the most-popular subset of them. The propagation networks include both retweets and conversational threads (i.e., threads of replies). ArCOV-19 is designed to enable research under several domains including natural language processing, information retrieval, and social computing, among others. Preliminary analysis shows that ArCOV-19 captures rising discussions associated with the first reported cases of the disease as they appeared in the Arab world. In addition to the source tweets and the propagation networks, we also release the search queries and the language-independent crawler used to collect the tweets to encourage the curation of similar datasets.

READ FULL TEXT
research
10/17/2020

ArCOV19-Rumors: Arabic COVID-19 Twitter Dataset for Misinformation Detection

In this paper we introduce ArCOV19-Rumors, an Arabic COVID-19 Twitter da...
research
01/14/2023

Detecting Stance of Authorities towards Rumors in Arabic Tweets: A Preliminary Study

A myriad of studies addressed the problem of rumor verification in Twitt...
research
05/02/2020

Mega-COV: A Billion-Scale Dataset of 65 Languages For COVID-19

We describe Mega-COV, a billion-scale dataset from Twitter for studying ...
research
08/18/2017

EveTAR: Building a Large-Scale Multi-Task Test Collection over Arabic Tweets

This article introduces a new language-independent approach for creating...
research
10/05/2021

AraCOVID19-SSD: Arabic COVID-19 Sentiment and Sarcasm Detection Dataset

Coronavirus disease (COVID-19) is an infectious respiratory disease that...
research
07/23/2022

An NLP-Assisted Bayesian Time Series Analysis for Prevalence of Twitter Cyberbullying During the COVID-19 Pandemic

COVID-19 has brought about many changes in social dynamics. Stay-at-home...
research
06/21/2020

Automatic Query Optimization for Retrieving Traffic Tweets

Twitter, like many social media and data brokering companies, makes thei...

Please sign up or login with your details

Forgot password? Click here to reset