COVID-19 Twitter Dataset with Latent Topics, Sentiments and Emotions Attributes

by   Raj Kumar Gupta, et al.

This paper presents a large annotated dataset on public expressions related to the COVID-19 pandemic. Through Twitter's standard search application programming interface, we retrieved over 63 million coronavirus-related public posts from more than 13 million unique users since 28 January to 1 July 2020. Using natural language processing techniques and machine learning based algorithms, we annotated each public tweet with seventeen latent semantic attributes, including: 1) ten binary attributes indicating the tweet's relevance or irrelevance to ten detected topics, 2) five quantitative attributes indicating the degree of intensity of the valence or sentiment (from extremely negative to extremely positive), and the degree of intensity of fear, of anger, of sadness and of joy emotions (from extremely low intensity to extremely high intensity), and 3) two qualitative attributes indicating the sentiment category and the dominant emotion category, respectively. We report basic descriptive statistics around the topics, sentiments and emotions attributes and their temporal distributions, and discuss its possible usage in communication, psychology, public health, economics and epidemiology research.


page 5

page 8


Leveraging Natural Language Processing to Mine Issues on Twitter During the COVID-19 Pandemic

The recent global outbreak of the coronavirus disease (COVID-19) has spr...

How People Respond to the COVID-19 Pandemic on Twitter: A Comparative Analysis of Emotional Expressions from US and India

The COVID-19 pandemic has claimed millions of lives worldwide and elicit...

COVID-19 Vaccine and Social Media: Exploring Emotions and Discussions on Twitter

Public response to COVID-19 vaccines is the key success factor to contro...

COVID-19 Public Opinion and Emotion Monitoring System Based on Time Series Thermal New Word Mining

With the spread and development of new epidemics, it is of great referen...

Contrastive Clustering: Toward Unsupervised Bias Reduction for Emotion and Sentiment Classification

Background: When neural network emotion and sentiment classifiers are us...

Please sign up or login with your details

Forgot password? Click here to reset