Duluth at SemEval-2020 Task 12: Offensive Tweet Identification in English with Logistic Regression

07/25/2020
by   Ted Pedersen, et al.
0

This paper describes the Duluth systems that participated in SemEval–2020 Task 12, Multilingual Offensive Language Identification in Social Media (OffensEval–2020). We participated in the three English language tasks. Our systems provide a simple Machine Learning baseline using logistic regression. We trained our models on the distantly supervised training data made available by the task organizers and used no other resources. As might be expected we did not rank highly in the comparative evaluation: 79th of 85 in Task A, 34th of 43 in Task B, and 24th of 39 in Task C. We carried out a qualitative analysis of our results and found that the class labels in the gold standard data are somewhat noisy. We hypothesize that the extremely high accuracy (> 90 top ranked systems may reflect methods that learn the training data very well but may not generalize to the task of identifying offensive language in English. This analysis includes examples of tweets that despite being mildly redacted are still offensive.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2018

Duluth UROP at SemEval-2018 Task 2: Multilingual Emoji Prediction with Ensemble Learning and Oversampling

This paper describes the Duluth UROP systems that participated in SemEva...
research
07/25/2020

Duluth at SemEval-2019 Task 6: Lexical Approaches to Identify and Categorize Offensive Tweets

This paper describes the Duluth systems that participated in SemEval–201...
research
05/14/2020

NIT-Agartala-NLP-Team at SemEval-2020 Task 8: Building Multimodal Classifiers to tackle Internet Humor

The paper describes the systems submitted to SemEval-2020 Task 8: Memoti...
research
10/02/2017

Identifying Nominals with No Head Match Co-references Using Deep Learning

Identifying nominals with no head match is a long-standing challenge in ...
research
04/10/2021

Identifying and Categorizing Offensive Language in Social Media

Offensive language is pervasive in social media. Individuals frequently ...
research
03/04/2023

RweetMiner: Automatic identification and categorization of help requests on twitter during disasters

Catastrophic events create uncertain situations for humanitarian organiz...
research
07/23/2023

Comparative analysis using classification methods versus early stage diabetes

In this research work, a comparative analysis was carried out using clas...

Please sign up or login with your details

Forgot password? Click here to reset