Screening Methods for Classification Based on Non-parametric Bayesian Tests

01/05/2023
by   Naveed Merchant, et al.
0

Feature or variable selection is a problem inherent to large data sets. While many methods have been proposed to deal with this problem, some can scale poorly with the number of predictors in a data set. Screening methods scale linearly with the number of predictors by checking each predictor one at a time, and are a tool used to decrease the number of variables to consider before further analysis or variable selection. For classification, there is a variety of techniques. There are parametric based screening tests, such as t-test or SIS based screening, and non-parametric based screening tests, such as Kolmogorov distance based screening, and MV-SIS. We propose a method for variable screening that uses Bayesian-motivated tests, compare it to SIS based screening, and provide example applications of the method on simulated and real data. It is shown that our screening method can lead to improvements in classification rate. This is so even when our method is used in conjunction with a classifier, such as DART, which is designed to select a sparse subset of variables. Finally, we propose a classifier based on kernel density estimates that in some cases can produce dramatic improvements in classification rates relative to DART.

READ FULL TEXT

page 17

page 18

page 19

research
06/09/2023

Variable screening using factor analysis for high-dimensional data with multicollinearity

Screening methods are useful tools for variable selection in regression ...
research
12/30/2017

An ISIS screening approach involving threshold/partition for variable selection in linear regression

In linear regression, one can select a predictor if the absolute sample ...
research
05/08/2022

On Exact Feature Screening in Ultrahigh-dimensional Binary Classification

We propose a new model-free feature screening method based on energy dis...
research
02/04/2020

Accelerating Psychometric Screening Tests With Bayesian Active Differential Selection

Classical methods for psychometric function estimation either require ex...
research
01/04/2019

Fast Multi-Class Probabilistic Classifier by Sparse Non-parametric Density Estimation

The model interpretation is essential in many application scenarios and ...
research
04/22/2013

Bayesian crack detection in ultra high resolution multimodal images of paintings

The preservation of our cultural heritage is of paramount importance. Th...

Please sign up or login with your details

Forgot password? Click here to reset