XD at SemEval-2020 Task 12: Ensemble Approach to Offensive Language Identification in Social Media Using Transformer Encoders

07/21/2020
by   Xiangjue Dong, et al.
0

This paper presents six document classification models using the latest transformer encoders and a high-performing ensemble model for a task of offensive language identification in social media. For the individual models, deep transformer layers are applied to perform multi-head attentions. For the ensemble model, the utterance representations taken from those individual models are concatenated and fed into a linear decoder to make the final decisions. Our ensemble model outperforms the individual models and shows up to 8.6 set, it achieves macro-F1 of 90.9 systems among 85 participants in the sub-task A of this shared task. Our analysis shows that although the ensemble model significantly improves the accuracy on the development set, the improvement is not as evident on the test set.

READ FULL TEXT
research
05/22/2020

Transformer-based Context-aware Sarcasm Detection in Conversation Threads from Social Media

We present a transformer-based sarcasm detection model that accounts for...
research
05/14/2023

Unraveling Cold Start Enigmas in Predictive Analytics for OTT Media: Synergistic Meta-Insights and Multimodal Ensemble Mastery

The cold start problem is a common challenge in various domains, includi...
research
02/18/2021

UnibucKernel: Geolocating Swiss German Jodels Using Ensemble Learning

In this work, we describe our approach addressing the Social Media Varie...
research
08/20/2023

cantnlp@LT-EDI-2023: Homophobia/Transphobia Detection in Social Media Comments using Spatio-Temporally Retrained Language Models

This paper describes our multiclass classification system developed as p...
research
03/14/2019

OffensEval at SemEval-2019 Task 6: Okham's Razor on Identifying and Categorizing Offensive Language in Social Media

This document describes our approach to building an Offensive Language C...
research
01/15/2022

Ensemble Transformer for Efficient and Accurate Ranking Tasks: an Application to Question Answering Systems

Large transformer models can highly improve Answer Sentence Selection (A...
research
07/30/2020

The Unreasonable Effectiveness of Machine Learning in Moldavian versus Romanian Dialect Identification

In this work, we provide a follow-up on the Moldavian versus Romanian Cr...

Please sign up or login with your details

Forgot password? Click here to reset