Performance Comparison of Crowdworkers and NLP Tools onNamed-Entity Recognition and Sentiment Analysis of Political Tweets

02/11/2020
by   Mona Jalal, et al.
0

We report results of a comparison of the accuracy of crowdworkers and seven NaturalLanguage Processing (NLP) toolkits in solving two important NLP tasks, named-entity recognition (NER) and entity-level sentiment(ELS) analysis. We here focus on a challenging dataset, 1,000 political tweets that were collected during the U.S. presidential primary election in February 2016. Each tweet refers to at least one of four presidential candidates,i.e., four named entities. The groundtruth, established by experts in political communication, has entity-level sentiment information for each candidate mentioned in the tweet. We tested several commercial and open-source tools. Our experiments show that, for our dataset of political tweets, the most accurate NER system, Google Cloud NL, performed almost on par with crowdworkers, but the most accurate ELS analysis system, TensiStrength, did not match the accuracy of crowdworkers by a large margin of more than 30 percent points.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2017

Joint Named Entity Recognition and Stance Detection in Tweets

Named entity recognition (NER) is a well-established task of information...
research
01/15/2019

A Tweet Dataset Annotated for Named Entity Recognition and Stance Detection

Annotated datasets in different domains are critical for many supervised...
research
10/14/2022

TweetNERD – End to End Entity Linking Benchmark for Tweets

Named Entity Recognition and Disambiguation (NERD) systems are foundatio...
research
07/24/2017

CAp 2017 challenge: Twitter Named Entity Recognition

The paper describes the CAp 2017 challenge. The challenge concerns the p...
research
04/29/2023

Examining European Press Coverage of the Covid-19 No-Vax Movement: An NLP Framework

This paper examines how the European press dealt with the no-vax reactio...
research
04/04/2022

Product Market Demand Analysis Using NLP in Banglish Text with Sentiment Analysis and Named Entity Recognition

Product market demand analysis plays a significant role for originating ...
research
05/18/2020

A Semantically Enriched Dataset based on Biomedical NER for the COVID19 Open Research Dataset Challenge

Research into COVID-19 is a big challenge and highly relevant at the mom...

Please sign up or login with your details

Forgot password? Click here to reset