Will-They-Won't-They: A Very Large Dataset for Stance Detection on Twitter

05/01/2020
by   Costanza Conforti, et al.
0

We present a new challenging stance detection dataset, called Will-They-Won't-They (WT-WT), which contains 51,284 tweets in English, making it by far the largest available dataset of the type. All the annotations are carried out by experts; therefore, the dataset constitutes a high-quality and reliable benchmark for future research in stance detection. Our experiments with a wide range of recent state-of-the-art stance detection systems show that the dataset poses a strong challenge to existing models in this domain.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/28/2023

Antisemitic Messages? A Guide to High-Quality Annotation and a Labeled Dataset of Tweets

One of the major challenges in automatic hate speech detection is the la...
research
02/20/2021

Concealed Object Detection

We present the first systematic study on concealed object detection (COD...
research
08/08/2022

A High-Quality and Large-Scale Dataset for English-Vietnamese Speech Translation

In this paper, we introduce a high-quality and large-scale benchmark dat...
research
09/11/2023

Personality Detection and Analysis using Twitter Data

Personality types are important in various fields as they hold relevant ...
research
05/22/2022

TWEET-FID: An Annotated Dataset for Multiple Foodborne Illness Detection Tasks

Foodborne illness is a serious but preventable public health problem – w...
research
09/09/2016

Harassment detection: a benchmark on the #HackHarassment dataset

Online harassment has been a problem to a greater or lesser extent since...
research
06/12/2019

The Herbarium Challenge 2019 Dataset

Herbarium sheets are invaluable for botanical research, and considerable...

Please sign up or login with your details

Forgot password? Click here to reset