Feature selection using binary grey wolf optimizer with elite-based crossover for Arabic text classification

11/20/2020
by   Ali Asghar Heidari, et al.
1

Text classification is one of the challenging computational tasks in machine learning community due to the increased amounts of natural language text documents available in the electronic forms. In this process, feature selection (FS) is an essential phase because thousands of possible feature sets may be considered in text classification. This paper proposes an enhanced binary grey wolf optimizer (GWO) within a wrapper FS approach to tackle Arabic text classification problems. The proposed binary GWO is utilized to play the role of a wrapper-based feature selection technique. The performance of the proposed method using different learning models, including decision trees, K-nearest neighbour, Naive Bayes, and SVM classifiers, are investigated. Three Arabic public datasets, namely Alwatan, Akhbar-Alkhaleej, and Al-jazeera-News, are utilized to evaluate the efficacy of different BGWO-based wrapper methods. Results and analysis show that SVM-based feature selection technique with the proposed binary GWO optimizer with elite-based crossover scheme has enhanced efficacy in dealing with Arabic text classification problems compared to other peers. Visit http://aliasgharheidari.com

READ FULL TEXT
research
02/17/2019

A Comparative Study of Feature Selection Methods for Dialectal Arabic Sentiment Classification Using Support Vector Machine

Unlike other languages, the Arabic language has a morphological complexi...
research
04/13/2015

Egyptian Dialect Stopword List Generation from Social Network Data

This paper proposes a methodology for generating a stopword list from on...
research
03/20/2017

Metalearning for Feature Selection

A general formulation of optimization problems in which various candidat...
research
02/26/2015

Rational Kernels for Arabic Stemming and Text Classification

In this paper, we address the problems of Arabic Text Classification and...
research
04/23/2019

Topic Classification Method for Analyzing Effect of eWOM on Consumer Game Sales

Electronic word-of-mouth (eWOM) has become an important resource for the...
research
10/24/2018

A Text Classification Application: Poet Detection from Poetry

With the widespread use of the internet, the size of the text data incre...
research
10/25/2017

Re-evaluating the need for Modelling Term-Dependence in Text Classification Problems

A substantial amount of research has been carried out in developing mach...

Please sign up or login with your details

Forgot password? Click here to reset