NAIST COVID: Multilingual COVID-19 Twitter and Weibo Dataset

04/17/2020
by   Zhiwei Gao, et al.
0

Since the outbreak of coronavirus disease 2019 (COVID-19) in the late 2019, it has affected over 200 countries and billions of people worldwide. This has affected the social life of people owing to enforcements, such as "social distancing" and "stay at home." This has resulted in an increasing interaction through social media. Given that social media can bring us valuable information about COVID-19 at a global scale, it is important to share the data and encourage social media studies against COVID-19 or other infectious diseases. Therefore, we have released a multilingual dataset of social media posts related to COVID-19, consisting of microblogs in English and Japanese from Twitter and those in Chinese from Weibo. The data cover microblogs from January 20, 2020, to March 24, 2020. This paper also provides a quantitative as well as qualitative analysis of these datasets by creating daily word clouds as an example of text-mining analysis. The dataset is now available on Github. This dataset can be analyzed in a multitude of ways and is expected to help in efficient communication of precautions related to COVID-19.

READ FULL TEXT
research
04/09/2020

Large Arabic Twitter Dataset on COVID-19

The 2019 coronavirus disease (COVID-19), emerged late December 2019 in C...
research
04/09/2021

The Burden of Being a Bridge: Understanding the Role of Multilingual Users during the COVID-19 Pandemic

The outbreak of the COVID-19 pandemic triggers infodemic over online soc...
research
10/06/2020

Image-based Social Sensing: Combining AI and the Crowd to Mine Policy-Adherence Indicators from Twitter

Social Media provides a trove of information that, if aggregated and ana...
research
06/22/2021

Simulation-Driven COVID-19 Epidemiological Modeling with Social Media

Modern Bayesian approaches and workflows emphasize in how simulation is ...
research
07/26/2021

IRLCov19: A Large COVID-19 Multilingual Twitter Dataset of Indian Regional Languages

Emerged in Wuhan city of China in December 2019, COVID-19 continues to s...
research
06/27/2022

"Double vaccinated, 5G boosted!": Learning Attitudes towards COVID-19 Vaccination from Social Media

To address the vaccine hesitancy which impairs the efforts of the COVID-...
research
11/10/2021

Understanding COVID-19 Vaccine Reaction through Comparative Analysis on Twitter

Although multiple COVID-19 vaccines have been available for several mont...

Please sign up or login with your details

Forgot password? Click here to reset