DeepAI AI Chat
Log In Sign Up

Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

05/12/2022
by   Manikandan Ravikiran, et al.
Georgia Institute of Technology
Insight Centre for Data Analytics
0

Offensive content moderation is vital in social media platforms to support healthy online discussions. However, their prevalence in codemixed Dravidian languages is limited to classifying whole comments without identifying part of it contributing to offensiveness. Such limitation is primarily due to the lack of annotated data for offensive spans. Accordingly, in this shared task, we provide Tamil-English code-mixed social comments with offensive spans. This paper outlines the dataset so released, methods, and results of the submitted systems

READ FULL TEXT

page 1

page 2

page 3

page 4

11/18/2021

Pegasus@Dravidian-CodeMix-HASOC2021: Analyzing Social Media Content for Detection of Offensive Text

To tackle the conundrum of detecting offensive comments/posts which are ...
10/17/2020

CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets

With the popularity of social media, communications through blogs, Faceb...
11/11/2022

CoRAL: a Context-aware Croatian Abusive Language Dataset

In light of unprecedented increases in the popularity of the internet an...
10/11/2021

TEET! Tunisian Dataset for Toxic Speech Detection

The complete freedom of expression in social media has its costs especia...
07/30/2021

WLV-RIT at GermEval 2021: Multitask Learning with Transformers to Detect Toxic, Engaging, and Fact-Claiming Comments

This paper addresses the identification of toxic, engaging, and fact-cla...
08/27/2022

Measuring the Prevalence of Anti-Social Behavior in Online Communities

With increasing attention to online anti-social behaviors such as person...