Egyptian Dialect Stopword List Generation from Social Network Data

04/13/2015
by   Walaa Medhat, et al.
0

This paper proposes a methodology for generating a stopword list from online social network (OSN) corpora in Egyptian Dialect(ED). The aim of the paper is to investigate the effect of removingED stopwords on the Sentiment Analysis (SA) task. The stopwords lists generated before were on Modern Standard Arabic (MSA) which is not the common language used in OSN. We have generated a stopword list of Egyptian dialect to be used with the OSN corpora. We compare the efficiency of text classification when using the generated list along with previously generated lists of MSA and combining the Egyptian dialect list with the MSA list. The text classification was performed using Naïve Bayes and Decision Tree classifiers and two feature selection approaches, unigram and bigram. The experiments show that removing ED stopwords give better performance than using lists of MSA stopwords only.

READ FULL TEXT
research
11/20/2020

Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification

Text classification is one of the challenging computational tasks in mac...
research
11/21/2014

Falling Rule Lists

Falling rule lists are classification models consisting of an ordered li...
research
05/12/2020

List homomorphism problems for signed graphs

We consider homomorphisms of signed graphs from a computational perspect...
research
02/07/2018

Structure and Stability of Internet Top Lists

Active Internet measurement studies rely on a list of targets to be scan...
research
03/01/2006

Towards a better list of citation superstars: compiling a multidisciplinary list of highly cited researchers

A new approach to producing multidisciplinary lists of highly cited rese...
research
03/07/2020

Frozen Binomials on the Web: Word Ordering and Language Conventions in Online Text

There is inherent information captured in the order in which we write wo...
research
04/30/2023

The Art of the Fugue: Minimizing Interleaving in Collaborative Text Editing

Existing algorithms for replicated lists, which are widely used in colla...

Please sign up or login with your details

Forgot password? Click here to reset