Developing Successful Shared Tasks on Offensive Language Identification for Dravidian Languages

With the fast growth of mobile computing and Web technologies, offensive language has become more prevalent on social networking platforms. Since offensive language identification in local languages is essential to moderate the social media content, in this paper we work with three Dravidian languages, namely Malayalam, Tamil, and Kannada, that are under-resourced. We present an evaluation task at FIRE 2020- HASOC-DravidianCodeMix and DravidianLangTech at EACL 2021, designed to provide a framework for comparing different approaches to this problem. This paper describes the data creation, defines the task, lists the participating systems, and discusses various methods.

READ FULL TEXT

page 1

page 17

page 18

research
06/12/2020

SemEval-2020 Task 12: Multilingual Offensive Language Identification in Social Media (OffensEval 2020)

We present the results and main findings of SemEval-2020 Task 12 on Mult...
research
09/27/2019

HateMonitors: Language Agnostic Abuse Detection in Social Media

Reducing hateful and offensive content in online social media pose a dua...
research
11/22/2022

Predicting the Type and Target of Offensive Social Media Posts in Marathi

The presence of offensive language on social media is very common motiva...
research
02/19/2021

Hate-Alert@DravidianLangTech-EACL2021: Ensembling strategies for Transformer-based Offensive language Detection

Social media often acts as breeding grounds for different forms of offen...
research
06/09/2022

Language Identification for Austronesian Languages

This paper provides language identification models for low- and under-re...
research
04/03/2023

Approaches to Corpus Creation for Low-Resource Language Technology: the Case of Southern Kurdish and Laki

One of the major challenges that under-represented and endangered langua...
research
07/11/2020

Feature Selection on Noisy Twitter Short Text Messages for Language Identification

The task of written language identification involves typically the detec...

Please sign up or login with your details

Forgot password? Click here to reset