A Comparative Study of Feature Selection Methods for Dialectal Arabic Sentiment Classification Using Support Vector Machine

02/17/2019
by   Omar Al-Harbi, et al.
0

Unlike other languages, the Arabic language has a morphological complexity which makes the Arabic sentiment analysis is a challenging task. Moreover, the presence of the dialects in the Arabic texts have made the sentiment analysis task is more challenging, due to the absence of specific rules that govern the writing or speaking system. Generally, one of the problems of sentiment analysis is the high dimensionality of the feature vector. To resolve this problem, many feature selection methods have been proposed. In contrast to the dialectal Arabic language, these selection methods have been investigated widely for the English language. This work investigated the effect of feature selection methods and their combinations on dialectal Arabic sentiment classification. The feature selection methods are Information Gain (IG), Correlation, Support Vector Machine (SVM), Gini Index (GI), and Chi-Square. A number of experiments were carried out on dialectical Jordanian reviews with using an SVM classifier. Furthermore, the effect of different term weighting schemes, stemmers, stop words removal, and feature models on the performance were investigated. The experimental results showed that the best performance of the SVM classifier was obtained after the SVM and correlation feature selection methods had been combined with the uni-gram model.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/25/2017

Using objective words in the reviews to improve the colloquial arabic sentiment analysis

One of the main difficulties in sentiment analysis of the Arabic languag...
research
12/27/2014

Persian Sentiment Analyzer: A Framework based on a Novel Feature Selection Method

In the recent decade, with the enormous growth of digital content in int...
research
11/20/2020

Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification

Text classification is one of the challenging computational tasks in mac...
research
04/21/2020

A novel embedded min-max approach for feature selection in nonlinear Support Vector Machine classification

In recent years, feature selection has become a challenging problem in s...
research
03/12/2020

TF-IDFC-RF: A Novel Supervised Term Weighting Scheme

Sentiment Analysis is a branch of Affective Computing usually considered...
research
09/20/2017

Identifying Restaurant Features via Sentiment Analysis on Yelp Reviews

Many people use Yelp to find a good restaurant. Nonetheless, with only a...
research
03/28/2023

An Experimental Study on Sentiment Classification of Moroccan dialect texts in the web

With the rapid growth of the use of social media websites, obtaining the...

Please sign up or login with your details

Forgot password? Click here to reset