DeepAI AI Chat
Log In Sign Up

Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

by   Manikandan Ravikiran, et al.
Georgia Institute of Technology
Insight Centre for Data Analytics

Offensive content moderation is vital in social media platforms to support healthy online discussions. However, their prevalence in codemixed Dravidian languages is limited to classifying whole comments without identifying part of it contributing to offensiveness. Such limitation is primarily due to the lack of annotated data for offensive spans. Accordingly, in this shared task, we provide Tamil-English code-mixed social comments with offensive spans. This paper outlines the dataset so released, methods, and results of the submitted systems


page 1

page 2

page 3

page 4


Pegasus@Dravidian-CodeMix-HASOC2021: Analyzing Social Media Content for Detection of Offensive Text

To tackle the conundrum of detecting offensive comments/posts which are ...

CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets

With the popularity of social media, communications through blogs, Faceb...

CoRAL: a Context-aware Croatian Abusive Language Dataset

In light of unprecedented increases in the popularity of the internet an...

TEET! Tunisian Dataset for Toxic Speech Detection

The complete freedom of expression in social media has its costs especia...

WLV-RIT at GermEval 2021: Multitask Learning with Transformers to Detect Toxic, Engaging, and Fact-Claiming Comments

This paper addresses the identification of toxic, engaging, and fact-cla...

Measuring the Prevalence of Anti-Social Behavior in Online Communities

With increasing attention to online anti-social behaviors such as person...