On Exact Feature Screening in Ultrahigh-dimensional Binary Classification

05/08/2022
by   Sarbojit Roy, et al.
0

We propose a new model-free feature screening method based on energy distances for ultrahigh-dimensional binary classification problems. Unlike existing methods, the cut-off involved in our procedure is data adaptive. With a high probability, the proposed method retains only relevant features after discarding all the noise variables. The proposed screening method is also extended to identify pairs of variables that are marginally undetectable, but have differences in their joint distributions. Finally, we build a classifier which maintains coherence between the proposed feature selection criteria and discrimination method, and also establish its risk consistency. An extensive numerical study with simulated data sets and real benchmark data sets show clear and convincing advantages of our classifier over the state-of-the-art methods.

READ FULL TEXT
research
08/19/2019

Model-free Feature Screening and FDR Control with Knockoff Features

This paper proposes a model-free and data-adaptive feature screening met...
research
01/05/2023

Screening Methods for Classification Based on Non-parametric Bayesian Tests

Feature or variable selection is a problem inherent to large data sets. ...
research
09/12/2020

Multiclass Model for Agriculture development using Multivariate Statistical method

Mahalanobis taguchi system (MTS) is a multi-variate statistical method e...
research
08/19/2019

Model-free Feature Screening with Projection Correlation and FDR Control with Knockoff Features

This paper proposes a model-free and data-adaptive feature screening met...
research
01/10/2018

Strong Sure Screening of Ultra-high Dimensional Categorical Data

Feature screening for ultra high dimensional feature spaces plays a crit...
research
02/15/2023

A model-free feature selection technique of feature screening and random forest based recursive feature elimination

In this paper, we propose a model-free feature selection method for ultr...
research
01/10/2022

SMLE: An R Package for Joint Feature Screening in Ultrahigh-dimensional GLMs

The sparsity-restricted maximum likelihood estimator (SMLE) has received...

Please sign up or login with your details

Forgot password? Click here to reset