Hitachi at SemEval-2020 Task 12: Offensive Language Identification with Noisy Labels using Statistical Sampling and Post-Processing

by   Manikandan Ravikiran, et al.

In this paper, we present our participation in SemEval-2020 Task-12 Subtask-A (English Language) which focuses on offensive language identification from noisy labels. To this end, we developed a hybrid system with the BERT classifier trained with tweets selected using Statistical Sampling Algorithm (SA) and Post-Processed (PP) using an offensive wordlist. Our developed system achieved 34 th position with Macro-averaged F1-score (Macro-F1) of 0.90913 over both offensive and non-offensive classes. We further show comprehensive results and error analysis to assist future research in offensive language identification with noisy labels.



page 1

page 2

page 3

page 4


UM-IU@LING at SemEval-2019 Task 6: Identifying Offensive Tweets Using BERT and SVMs

This paper describes the UM-IU@LING's system for the SemEval 2019 Task 6...

Combination of multiple Deep Learning architectures for Offensive Language Detection in Tweets

This report contains the details regarding our submission to the OffensE...

aschern at SemEval-2020 Task 11: It Takes Three to Tango: RoBERTa, CRF, and Transfer Learning

We describe our system for SemEval-2020 Task 11 on Detection of Propagan...

Absit invidia verbo: Comparing Deep Learning methods for offensive language

This document describes our approach to building an Offensive Language C...

Using Spatial Pooler of Hierarchical Temporal Memory to classify noisy videos with predefined complexity

This paper examines the performance of a Spatial Pooler (SP) of a Hierar...

Language Identification with a Reciprocal Rank Classifier

Language identification is a critical component of language processing p...

ClaiRE at SemEval-2018 Task 7 - Extended Version

In this paper we describe our post-evaluation results for SemEval-2018 T...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.