NEU at WNUT-2020 Task 2: Data Augmentation To Tell BERT That Death Is Not Necessarily Informative

09/18/2020
by   Kumud Chauhan, et al.
0

Millions of people around the world are sharing COVID-19 related information on social media platforms. Since not all the information shared on the social media is useful, a machine learning system to identify informative posts can help users in finding relevant information. In this paper, we present a BERT classifier system for W-NUT2020 Shared Task 2: Identification of Informative COVID-19 English Tweets. Further, we show that BERT exploits some easy signals to identify informative tweets, and adding simple patterns to uninformative tweets drastically degrades BERT performance. In particular, simply adding 10 deaths to tweets in dev set, reduces BERT F1- score from 92.63 to 7.28. We also propose a simple data augmentation technique that helps in improving the robustness and generalization ability of the BERT classifier.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/21/2020

Detection of COVID-19 informative tweets using RoBERTa

Social media such as Twitter is a hotspot of user-generated information....
research
03/07/2023

Classifying Text-Based Conspiracy Tweets related to COVID-19 using Contextualized Word Embeddings

The FakeNews task in MediaEval 2022 investigates the challenge of findin...
research
08/02/2021

Changes in European Solidarity Before and During COVID-19: Evidence from a Large Crowd- and Expert-Annotated Twitter Dataset

We introduce the well-established social scientific concept of social so...
research
11/05/2021

Sexism Identification in Tweets and Gabs using Deep Neural Networks

Through anonymisation and accessibility, social media platforms have fac...
research
10/11/2020

InfoMiner at WNUT-2020 Task 2: Transformer-based Covid-19 Informative Tweet Extraction

Identifying informative tweets is an important step when building inform...
research
09/14/2020

Not-NUTs at W-NUT 2020 Task 2: A BERT-based System in Identifying Informative COVID-19 English Tweets

As of 2020 when the COVID-19 pandemic is full-blown on a global scale, p...
research
08/30/2020

QMUL-SDS at CheckThat! 2020: Determining COVID-19 Tweet Check-Worthiness Using an Enhanced CT-BERT with Numeric Expressions

This paper describes the participation of the QMUL-SDS team for Task 1 o...

Please sign up or login with your details

Forgot password? Click here to reset