Persian Sentiment Analyzer: A Framework based on a Novel Feature Selection Method

by   Ayoub Bagheri, et al.

In the recent decade, with the enormous growth of digital content in internet and databases, sentiment analysis has received more and more attention between information retrieval and natural language processing researchers. Sentiment analysis aims to use automated tools to detect subjective information from reviews. One of the main challenges in sentiment analysis is feature selection. Feature selection is widely used as the first stage of analysis and classification tasks to reduce the dimension of problem, and improve speed by the elimination of irrelevant and redundant features. Up to now as there are few researches conducted on feature selection in sentiment analysis, there are very rare works for Persian sentiment analysis. This paper considers the problem of sentiment classification using different feature selection methods for online customer reviews in Persian language. Three of the challenges of Persian text are using of a wide variety of declensional suffixes, different word spacing and many informal or colloquial words. In this paper we study these challenges by proposing a model for sentiment classification of Persian review documents. The proposed model is based on lemmatization and feature selection and is employed Naive Bayes algorithm for classification. We evaluate the performance of the model on a manually gathered collection of cellphone reviews, where the results show the effectiveness of the proposed approaches.



There are no comments yet.


page 12


A Comparative Study of Feature Selection Methods for Dialectal Arabic Sentiment Classification Using Support Vector Machine

Unlike other languages, the Arabic language has a morphological complexi...

Study of sampling methods in sentiment analysis of imbalanced data

This work investigates the application of sampling methods for sentiment...

Efficient Feature Selection techniques for Sentiment Analysis

Sentiment analysis is a domain of study that focuses on identifying and ...

Using objective words in the reviews to improve the colloquial arabic sentiment analysis

One of the main difficulties in sentiment analysis of the Arabic languag...

How Important Is a Neuron?

The problem of attributing a deep network's prediction to its input/base...

Rare Feature Selection in High Dimensions

It is common in modern prediction problems for many predictor variables ...

A novel approach to sentiment analysis in Persian using discourse and external semantic information

Sentiment analysis attempts to identify, extract and quantify affective ...
This week in AI

Get the week's most popular data science and artificial intelligence research sent straight to your inbox every Saturday.