bitsa_nlp@LT-EDI-ACL2022: Leveraging Pretrained Language Models for Detecting Homophobia and Transphobia in Social Media Comments

03/27/2022
by   Vitthal Bhandari, et al.
0

Online social networks are ubiquitous and user-friendly. Nevertheless, it is vital to detect and moderate offensive content to maintain decency and empathy. However, mining social media texts is a complex task since users don't adhere to any fixed patterns. Comments can be written in any combination of languages and many of them may be low-resource. In this paper, we present our system for the LT-EDI shared task on detecting homophobia and transphobia in social media comments. We experiment with a number of monolingual and multilingual transformer based models such as mBERT along with a data augmentation technique for tackling class imbalance. Such pretrained large models have recently shown tremendous success on a variety of benchmark tasks in natural language processing. We observe their performance on a carefully annotated, real life dataset of YouTube comments in English as well as Tamil. Our submission achieved ranks 9, 6 and 3 with a macro-averaged F1-score of 0.42, 0.64 and 0.58 in the English, Tamil and Tamil-English subtasks respectively. The code for the system has been open sourced.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2023

cantnlp@LT-EDI-2023: Homophobia/Transphobia Detection in Social Media Comments using Spatio-Temporally Retrained Language Models

This paper describes our multiclass classification system developed as p...
research
04/20/2023

"HOT" ChatGPT: The promise of ChatGPT in detecting and discriminating hateful, offensive, and toxic comments on social media

Harmful content is pervasive on social media, poisoning online communiti...
research
04/19/2022

Optimize_Prime@DravidianLangTech-ACL2022: Abusive Comment Detection in Tamil

This paper tries to address the problem of abusive comment detection in ...
research
04/03/2023

Detection of Homophobia Transphobia in Dravidian Languages: Exploring Deep Learning Methods

The increase in abusive content on online social media platforms is impa...
research
03/03/2021

Hate, Obscenity, and Insults: Measuring the Exposure of Children to Inappropriate Comments in YouTube

Social media has become an essential part of the daily routines of child...
research
01/09/2020

Offensive Language Detection: A Comparative Analysis

Offensive behaviour has become pervasive in the Internet community. Indi...
research
09/11/2019

Kashmir: A Computational Analysis of the Voice of Peace

The recent Pulwama terror attack (February 14, 2019, Pulwama, Kashmir) t...

Please sign up or login with your details

Forgot password? Click here to reset