A Precisely Xtreme-Multi Channel Hybrid Approach For Roman Urdu Sentiment Analysis

03/11/2020
by   Faiza Memood, et al.
0

In order to accelerate the performance of various Natural Language Processing tasks for Roman Urdu, this paper for the very first time provides 3 neural word embeddings prepared using most widely used approaches namely Word2vec, FastText, and Glove. The integrity of generated neural word embeddings is evaluated using intrinsic and extrinsic evaluation approaches. Considering the lack of publicly available benchmark datasets, it provides a first-ever Roman Urdu dataset which consists of 3241 sentiments annotated against positive, negative and neutral classes. To provide benchmark baseline performance over the presented dataset, we adapt diverse machine learning (Support Vector Machine Logistic Regression, Naive Bayes), deep learning (convolutional neural network, recurrent neural network), and hybrid approaches. Effectiveness of generated neural word embeddings is evaluated by comparing the performance of machine and deep learning based methodologies using 7, and 5 distinct feature representation approaches respectively. Finally, it proposes a novel precisely extreme multi-channel hybrid methodology which outperforms state-of-the-art adapted machine and deep learning approaches by the figure of 9 terms of F1-score. Roman Urdu Sentiment Analysis, Pretrain word embeddings for Roman Urdu, Word2Vec, Glove, Fast-Text

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2017

Sentiment Analysis by Joint Learning of Word Embeddings and Classifier

Word embeddings are representations of individual words of a text docume...
research
11/05/2015

An Empirical Study on Sentiment Classification of Chinese Review using Word Embedding

In this article, how word embeddings can be used as features in Chinese ...
research
03/03/2020

Benchmark Performance of Machine And Deep Learning Based Methodologies for Urdu Text Document Classification

In order to provide benchmark performance for Urdu text document classif...
research
04/17/2019

MoralStrength: Exploiting a Moral Lexicon and Embedding Similarity for Moral Foundations Prediction

Opinions and attitudes towards controversial social and political issues...
research
09/12/2019

A Robust Hybrid Approach for Textual Document Classification

Text document classification is an important task for diverse natural la...
research
09/26/2020

Metaphor Detection using Deep Contextualized Word Embeddings

Metaphors are ubiquitous in natural language, and their detection plays ...
research
05/20/2021

TF-IDF vs Word Embeddings for Morbidity Identification in Clinical Notes: An Initial Study

Today, we are seeing an ever-increasing number of clinical notes that co...

Please sign up or login with your details

Forgot password? Click here to reset