TweetEval: Unified Benchmark and Comparative Evaluation for Tweet Classification

10/23/2020
by   Francesco Barbieri, et al.
0

The experimental landscape in natural language processing for social media is too fragmented. Each year, new shared tasks and datasets are proposed, ranging from classics like sentiment analysis to irony detection or emoji prediction. Therefore, it is unclear what the current state of the art is, as there is no standardized evaluation protocol, neither a strong set of baselines trained on such domain-specific data. In this paper, we propose a new evaluation framework (TweetEval) consisting of seven heterogeneous Twitter-specific classification tasks. We also provide a strong set of baselines as starting point, and compare different language modeling pre-training strategies. Our initial experiments show the effectiveness of starting off with existing pre-trained generic language models, and continue training them on Twitter corpora.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/17/2020

TweetBERT: A Pretrained Language Representation Model for Twitter Text Analysis

Twitter is a well-known microblogging social site where users express th...
research
04/25/2021

XLM-T: A Multilingual Language Model Toolkit for Twitter

Language models are ubiquitous in current NLP, and their multilingual ca...
research
10/11/2022

Relational Embeddings for Language Independent Stance Detection

The large majority of the research performed on stance detection has bee...
research
10/07/2021

UoB at SemEval-2021 Task 5: Extending Pre-Trained Language Models to Include Task and Domain-Specific Information for Toxic Span Prediction

Toxicity is pervasive in social media and poses a major threat to the he...
research
07/20/2023

A Dataset and Strong Baselines for Classification of Czech News Texts

Pre-trained models for Czech Natural Language Processing are often evalu...
research
07/18/2019

SentiMATE: Learning to play Chess through Natural Language Processing

We present SentiMATE, a novel end-to-end Deep Learning model for Chess, ...
research
11/10/2020

Towards Preemptive Detection of Depression and Anxiety in Twitter

Depression and anxiety are psychiatric disorders that are observed in ma...

Please sign up or login with your details

Forgot password? Click here to reset