CL-UZH at SemEval-2023 Task 10: Sexism Detection through Incremental Fine-Tuning and Multi-Task Learning with Label Descriptions

06/06/2023
by   Janis Goldzycher, et al.
0

The widespread popularity of social media has led to an increase in hateful, abusive, and sexist language, motivating methods for the automatic detection of such phenomena. The goal of the SemEval shared task Towards Explainable Detection of Online Sexism (EDOS 2023) is to detect sexism in English social media posts (subtask A), and to categorize such posts into four coarse-grained sexism categories (subtask B), and eleven fine-grained subcategories (subtask C). In this paper, we present our submitted systems for all three subtasks, based on a multi-task model that has been fine-tuned on a range of related tasks and datasets before being fine-tuned on the specific EDOS subtasks. We implement multi-task learning by formulating each task as binary pairwise text classification, where the dataset and label descriptions are given along with the input text. The results show clear improvements over a fine-tuned DeBERTa-V3 serving as a baseline leading to F_1-scores of 85.9% in subtask A (rank 13/84), 64.8% in subtask B (rank 19/69), and 44.9% in subtask C (26/63).

READ FULL TEXT
research
06/08/2023

LCT-1 at SemEval-2023 Task 10: Pre-training and Multi-task Learning for Sexism Detection and Classification

Misogyny and sexism are growing problems in social media. Advances have ...
research
07/21/2020

problemConquero at SemEval-2020 Task 12: Transformer and Soft label-based approaches

In this paper, we present various systems submitted by our team problemC...
research
01/17/2018

Automatic Detection of Cyberbullying in Social Media Text

While social media offer great communication opportunities, they also in...
research
03/07/2023

SemEval-2023 Task 10: Explainable Detection of Online Sexism

Online sexism is a widespread and harmful phenomenon. Automated tools ca...
research
06/14/2021

Dataset of Propaganda Techniques of the State-Sponsored Information Operation of the People's Republic of China

The digital media, identified as computational propaganda provides a pat...
research
06/15/2023

ChatGPT for Suicide Risk Assessment on Social Media: Quantitative Evaluation of Model Performance, Potentials and Limitations

This paper presents a novel framework for quantitatively evaluating the ...
research
01/31/2023

TopoBERT: Plug and Play Toponym Recognition Module Harnessing Fine-tuned BERT

Extracting precise geographical information from textual contents is cru...

Please sign up or login with your details

Forgot password? Click here to reset