How Does Tweet Difficulty Affect Labeling Performance of Annotators?

08/01/2018
by   Stefan Räbiger, et al.
0

Crowdsourcing is a popular means to obtain labeled data at moderate costs, for example for tweets, which can then be used in text mining tasks. To alleviate the problem of low-quality labels in this context, multiple human factors have been analyzed to identify and deal with workers who provide such labels. However, one aspect that has been rarely considered is the inherent difficulty of tweets to be labeled and how this affects the reliability of the labels that annotators assign to such tweets. Therefore, we investigate in this preliminary study this connection using a hierarchical sentiment labeling task on Twitter. We find that there is indeed a relationship between both factors, assuming that annotators have labeled some tweets before: labels assigned to easy tweets are more reliable than those assigned to difficult tweets. Therefore, training predictors on easy tweets enhances the performance by up to 6 techniques and crowdsourcing.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/31/2016

Dynamic Allocation of Crowd Contributions for Sentiment Analysis during the 2016 U.S. Presidential Election

Opinions about the 2016 U.S. Presidential Candidates have been expressed...
research
01/11/2019

BUOCA: Budget-Optimized Crowd Worker Allocation

Due to concerns about human error in crowdsourcing, it is standard pract...
research
04/21/2021

How Will Your Tweet Be Received? Predicting the Sentiment Polarity of Tweet Replies

Twitter sentiment analysis, which often focuses on predicting the polari...
research
04/26/2022

Treating Crowdsourcing as Examination: How to Score Tasks and Online Workers?

Crowdsourcing is an online outsourcing mode which can solve the current ...
research
03/25/2015

Regularized Minimax Conditional Entropy for Crowdsourcing

There is a rapidly increasing interest in crowdsourcing for data labelin...
research
11/26/2015

Hierarchical classification of e-commerce related social media

In this paper, we attempt to classify tweets into root categories of the...
research
10/04/2016

A Computational Approach to Automatic Prediction of Drunk Texting

Alcohol abuse may lead to unsociable behavior such as crime, drunk drivi...

Please sign up or login with your details

Forgot password? Click here to reset