TBCOV: Two Billion Multilingual COVID-19 Tweets with Sentiment, Entity, Geo, and Gender Labels

10/04/2021
by   Muhammad Imran, et al.
31

The widespread usage of social networks during mass convergence events, such as health emergencies and disease outbreaks, provides instant access to citizen-generated data that carry rich information about public opinions, sentiments, urgent needs, and situational reports. Such information can help authorities understand the emergent situation and react accordingly. Moreover, social media plays a vital role in tackling misinformation and disinformation. This work presents TBCOV, a large-scale Twitter dataset comprising more than two billion multilingual tweets related to the COVID-19 pandemic collected worldwide over a continuous period of more than one year. More importantly, several state-of-the-art deep learning models are used to enrich the data with important attributes, including sentiment labels, named-entities (e.g., mentions of persons, organizations, locations), user types, and gender information. Last but not least, a geotagging method is proposed to assign country, state, county, and city information to tweets, enabling a myriad of data analysis tasks to understand real-world issues. Our sentiment and trend analyses reveal interesting insights and confirm TBCOV's broad coverage of important topics.

READ FULL TEXT

page 1

page 3

page 7

page 8

page 9

page 12

page 13

research
06/13/2021

Sentiment Analysis of Covid-19 Tweets using Evolutionary Classification-Based LSTM Model

As the Covid-19 outbreaks rapidly all over the world day by day and also...
research
08/27/2020

Cross-language sentiment analysis of European Twitter messages duringthe COVID-19 pandemic

Social media data can be a very salient source of information during cri...
research
05/22/2020

GeoCoV19: A Dataset of Hundreds of Millions of Multilingual COVID-19 Tweets with Location Information

The past several years have witnessed a huge surge in the use of social ...
research
08/27/2020

Twitter Interaction to Analyze Covid-19 Impact in Ghana, Africa from March to July

The novel coronavirus, COVID-19, has impacted various aspects of the wor...
research
03/06/2022

Twitter Dataset for 2022 Russo-Ukrainian Crisis

Online Social Networks (OSNs) play a significant role in information sha...
research
07/05/2022

Deep Learning Reveals Patterns of Diverse and Changing Sentiments Towards COVID-19 Vaccines Based on 11 Million Tweets

Over 12 billion doses of COVID-19 vaccines have been administered at the...
research
05/21/2020

COVID-19 Public Sentiment Insights and Machine Learning for Tweets Classification

Along with the Coronavirus pandemic, another crisis has manifested itsel...

Please sign up or login with your details

Forgot password? Click here to reset