Feature selection in functional data classification with recursive maxima hunting

06/07/2018
by   José L. Torrecilla, et al.
0

Dimensionality reduction is one of the key issues in the design of effective machine learning methods for automatic induction. In this work, we introduce recursive maxima hunting (RMH) for variable selection in classification problems with functional data. In this context, variable selection techniques are especially attractive because they reduce the dimensionality, facilitate the interpretation and can improve the accuracy of the predictive models. The method, which is a recursive extension of maxima hunting (MH), performs variable selection by identifying the maxima of a relevance function, which measures the strength of the correlation of the predictor functional variable with the class label. At each stage, the information associated with the selected variable is removed by subtracting the conditional expectation of the process. The results of an extensive empirical evaluation are used to illustrate that, in the problems investigated, RMH has comparable or higher predictive accuracy than the standard dimensionality reduction techniques, such as PCA and PLS, and state-of-the-art feature selection methods for functional data, such as maxima hunting.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2021

A Subspace-based Approach for Dimensionality Reduction and Important Variable Selection

An analysis of high dimensional data can offer a detailed description of...
research
03/03/2021

Greedy Search Algorithms for Unsupervised Variable Selection: A Comparative Study

Dimensionality reduction is a important step in the development of scala...
research
12/14/2020

Recovery of Linear Components: Reduced Complexity Autoencoder Designs

Reducing dimensionality is a key preprocessing step in many data analysi...
research
12/06/2017

Sparsity Regularization for classification of large dimensional data

Feature selection has evolved to be a very important step in several mac...
research
12/11/2018

Classification of Cervical Cancer Dataset

Cervical cancer is the leading gynecological malignancy worldwide. This ...
research
03/26/2019

Sparse Learning for Variable Selection with Structures and Nonlinearities

In this thesis we discuss machine learning methods performing automated ...

Please sign up or login with your details

Forgot password? Click here to reset