Voice@SRIB at SemEval-2020 Task [9,12]: Sentiment and Offensiveness detection in Social Media

07/20/2020
by   Abhishek Singh, et al.
0

In social-media platforms such as Twitter, Facebook, and Reddit, people prefer to use code-mixed language such as Spanish-English, Hindi-English to express their opinions. In this paper, we describe different models we used, using the external dataset to train embeddings, ensembling methods for Sentimix, and OffensEval tasks. The use of pre-trained embeddings usually helps in multiple tasks such as sentence classification, and machine translation. In this experiment, we haveused our trained code-mixed embeddings and twitter pre-trained embeddings to SemEval tasks. We evaluate our models on macro F1-score, precision, accuracy, and recall on the datasets. We intend to show that hyper-parameter tuning and data pre-processing steps help a lot in improving the scores. In our experiments, we are able to achieve 0.886 F1-Macro on OffenEval Greek language subtask post-evaluation, whereas the highest is 0.852 during the Evaluation Period. We stood third in Spanglish competition with our best F1-score of 0.756. Codalab username is asking28.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2019

Offensive Language and Hate Speech Detection for Danish

The presence of offensive language on social media platforms and the imp...
research
10/12/2019

VAIS Hate Speech Detection System: A Deep Learning based Approach for System Combination

Nowadays, Social network sites (SNSs) such as Facebook, Twitter are comm...
research
02/06/2022

How Effective is Incongruity? Implications for Code-mix Sarcasm Detection

The presence of sarcasm in conversational systems and social media like ...
research
10/31/2016

Generating Sentiment Lexicons for German Twitter

Despite a substantial progress made in developing new sentiment lexicon ...
research
07/02/2019

Danish Stance Classification and Rumour Resolution

The Internet is rife with flourishing rumours that spread through microb...
research
06/01/2022

Vietnamese Hate and Offensive Detection using PhoBERT-CNN and Social Media Streaming Data

Society needs to develop a system to detect hate and offense to build a ...
research
02/18/2022

AMS_ADRN at SemEval-2022 Task 5: A Suitable Image-text Multimodal Joint Modeling Method for Multi-task Misogyny Identification

Women are influential online, especially in image-based social media suc...

Please sign up or login with your details

Forgot password? Click here to reset