Does a Hybrid Neural Network based Feature Selection Model Improve Text Classification?

01/22/2021
by   Suman Dowlagar, et al.
0

Text classification is a fundamental problem in the field of natural language processing. Text classification mainly focuses on giving more importance to all the relevant features that help classify the textual data. Apart from these, the text can have redundant or highly correlated features. These features increase the complexity of the classification algorithm. Thus, many dimensionality reduction methods were proposed with the traditional machine learning classifiers. The use of dimensionality reduction methods with machine learning classifiers has achieved good results. In this paper, we propose a hybrid feature selection method for obtaining relevant features by combining various filter-based feature selection methods and fastText classifier. We then present three ways of implementing a feature selection and neural network pipeline. We observed a reduction in training time when feature selection methods are used along with neural networks. We also observed a slight increase in accuracy on some datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/30/2022

IGRF-RFE: A Hybrid Feature Selection Method for MLP-based Network Intrusion Detection on UNSW-NB15 Dataset

The effectiveness of machine learning models is significantly affected b...
research
09/18/2023

Noise-Augmented Boruta: The Neural Network Perturbation Infusion with Boruta Feature Selection

With the surge in data generation, both vertically (i.e., volume of data...
research
06/07/2021

Evaluating Meta-Feature Selection for the Algorithm Recommendation Problem

With the popularity of Machine Learning (ML) solutions, algorithms and d...
research
10/10/2010

Multi-Objective Genetic Programming Projection Pursuit for Exploratory Data Modeling

For classification problems, feature extraction is a crucial process whi...
research
10/06/2020

Identifying Spurious Correlations for Robust Text Classification

The predictions of text classifiers are often driven by spurious correla...
research
12/06/2017

Sparsity Regularization for classification of large dimensional data

Feature selection has evolved to be a very important step in several mac...
research
09/12/2019

A Robust Hybrid Approach for Textual Document Classification

Text document classification is an important task for diverse natural la...

Please sign up or login with your details

Forgot password? Click here to reset