Ghmerti at SemEval-2019 Task 6: A Deep Word- and Character-based Approach to Offensive Language Identification

09/22/2020
by   Ehsan Doostmohammadi, et al.
0

This paper presents the models submitted by Ghmerti team for subtasks A and B of the OffensEval shared task at SemEval 2019. OffensEval addresses the problem of identifying and categorizing offensive language in social media in three subtasks; whether or not a content is offensive (subtask A), whether it is targeted (subtask B) towards an individual, a group, or other entities (subtask C). The proposed approach includes character-level Convolutional Neural Network, word-level Recurrent Neural Network, and some preprocessing. The performance achieved by the proposed model for subtask A is 77.93 macro-averaged F1-score.

READ FULL TEXT

page 3

page 4

research
11/26/2022

Transformer-based Model for Word Level Language Identification in Code-mixed Kannada-English Texts

Using code-mixed data in natural language processing (NLP) research curr...
research
09/11/2020

WOLI at SemEval-2020 Task 12: Arabic Offensive Language Identification on Different Twitter Datasets

Communicating through social platforms has become one of the principal m...
research
09/09/2018

SHOMA at Parseme Shared Task on Automatic Identification of VMWEs: Neural Multiword Expression Tagging with High Generalisation

This paper presents a language-independent deep learning architecture ad...
research
11/15/2017

Aicyber's System for NLPCC 2017 Shared Task 2: Voting of Baselines

This paper presents Aicyber's system for NLPCC 2017 shared task 2. It is...
research
12/27/2021

Secondary Use of Clinical Problem List Entries for Neural Network-Based Disease Code Assignment

Clinical information systems have become large repositories for semi-str...
research
08/10/2016

Hierarchical Character-Word Models for Language Identification

Social media messages' brevity and unconventional spelling pose a challe...
research
03/10/2022

SATLab at SemEval-2022 Task 4: Trying to Detect Patronizing and Condescending Language with only Character and Word N-grams

A logistic regression model only fed with character and word n-grams is ...

Please sign up or login with your details

Forgot password? Click here to reset