Using millions of emoji occurrences to learn any-domain representations for detecting sentiment, emotion and sarcasm

08/01/2017
by   Bjarke Felbo, et al.
0

NLP tasks are often limited by scarcity of manually annotated data. In social media sentiment analysis and related tasks, researchers have therefore used binarized emoticons and specific hashtags as forms of distant supervision. Our paper shows that by extending the distant supervision to a more diverse set of noisy labels, the models can learn richer representations. Through emoji prediction on a dataset of 1246 million tweets containing one of 64 common emojis we obtain state-of-the-art performance on 8 benchmark datasets within sentiment, emotion and sarcasm detection using a single pretrained model. Our analyses confirm that the diversity of our emotional labels yield a performance improvement over previous distant supervision approaches.

READ FULL TEXT

page 4

page 13

research
07/09/2017

PELESent: Cross-domain polarity classification using distant supervision

The enormous amount of texts published daily by Internet users has foste...
research
08/03/2018

A Multi-task Ensemble Framework for Emotion, Sentiment and Intensity Prediction

In this paper, through multi-task ensemble framework we address three pr...
research
01/11/2017

Efficient Twitter Sentiment Classification using Subjective Distant Supervision

As microblogging services like Twitter are becoming more and more influe...
research
07/04/2019

SEntiMoji: An Emoji-Powered Learning Approach for Sentiment Analysis in Software Engineering

Sentiment analysis has various application scenarios in software enginee...
research
05/20/2021

Happy Dance, Slow Clap: Using Reaction GIFs to Predict Induced Affect on Twitter

Datasets with induced emotion labels are scarce but of utmost importance...
research
11/09/2016

Distant supervision for emotion detection using Facebook reactions

We exploit the Facebook reaction feature in a distant supervised fashion...
research
09/21/2017

Inducing Distant Supervision in Suggestion Mining through Part-of-Speech Embeddings

Mining suggestion expressing sentences from a given text is a less inves...

Please sign up or login with your details

Forgot password? Click here to reset