CUSATNLP@HASOC-Dravidian-CodeMix-FIRE2020:Identifying Offensive Language from ManglishTweets

10/17/2020
by   Sara Renjit, et al.
0

With the popularity of social media, communications through blogs, Facebook, Twitter, and other plat-forms have increased. Initially, English was the only medium of communication. Fortunately, now we can communicate in any language. It has led to people using English and their own native or mother tongue language in a mixed form. Sometimes, comments in other languages have English transliterated format or other cases; people use the intended language scripts. Identifying sentiments and offensive content from such code mixed tweets is a necessary task in these times. We present a working model submitted for Task2 of the sub-track HASOC Offensive Language Identification- DravidianCodeMix in Forum for Information Retrieval Evaluation, 2020. It is a message level classification task. An embedding model-based classifier identifies offensive and not offensive comments in our approach. We applied this method in the Manglish dataset provided along with the sub-track.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/12/2022

Findings of the Shared Task on Offensive Span Identification from Code-Mixed Tamil-English Comments

Offensive content moderation is vital in social media platforms to suppo...
research
10/05/2020

Gauravarora@HASOC-Dravidian-CodeMix-FIRE2020: Pre-training ULMFiT on Synthetically Generated Code-Mixed Data for Hate Speech Detection

This paper describes the system submitted to Dravidian-Codemix-HASOC2020...
research
03/26/2018

Aggression-annotated Corpus of Hindi-English Code-mixed Data

As the interaction over the web has increased, incidents of aggression a...
research
03/31/2021

Misinformation detection in Luganda-English code-mixed social media text

The increasing occurrence, forms, and negative effects of misinformation...
research
01/15/2020

AggressionNet: Generalised Multi-Modal Deep Temporal and Sequential Learning for Aggression Identification

Wide usage of social media platforms has increased the risk of aggressio...
research
01/15/2020

A Unified System for Aggression Identification in English Code-Mixed and Uni-Lingual Texts

Wide usage of social media platforms has increased the risk of aggressio...
research
07/29/2016

Labeling of Query Words using Conditional Random Field

This paper describes our approach on Query Word Labeling as an attempt i...

Please sign up or login with your details

Forgot password? Click here to reset