Log In Sign Up

The emojification of sentiment on social media: Collection and analysis of a longitudinal Twitter sentiment dataset

by   Wenjie Yin, et al.

Social media, as a means for computer-mediated communication, has been extensively used to study the sentiment expressed by users around events or topics. There is however a gap in the longitudinal study of how sentiment evolved in social media over the years. To fill this gap, we develop TM-Senti, a new large-scale, distantly supervised Twitter sentiment dataset with over 184 million tweets and covering a time period of over seven years. We describe and assess our methodology to put together a large-scale, emoticon- and emoji-based labelled sentiment analysis dataset, along with an analysis of the resulting dataset. Our analysis highlights interesting temporal changes, among others in the increasing use of emojis over emoticons. We publicly release the dataset for further research in tasks including sentiment analysis and text classification of tweets. The dataset can be fully rehydrated including tweet metadata and without missing tweets thanks to the archive of tweets publicly available on the Internet Archive, which the dataset is based on.


page 1

page 2

page 3

page 4


An LSTM model for Twitter Sentiment Analysis

Sentiment analysis on social media such as Twitter provides organization...

A Dataset of State-Censored Tweets

Many governments impose traditional censorship methods on social media p...

A Novel Sentiment Analysis Engine for Preliminary Depression Status Estimation on Social Media

Text sentiment analysis for preliminary depression status estimation of ...

Sentiment of Emojis

There is a new generation of emoticons, called emojis, that is increasin...

Modeling Rich Contexts for Sentiment Classification with LSTM

Sentiment analysis on social media data such as tweets and weibo has bec...

How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies

Twitter sentiment analysis, which often focuses on predicting the polari...