Testing for Feature Relevance: The HARVEST Algorithm

09/30/2017
by   Herbert Weisberg, et al.
0

Feature selection with high-dimensional data and a very small proportion of relevant features poses a severe challenge to standard statistical methods. We have developed a new approach (HARVEST) that is straightforward to apply, albeit somewhat computer-intensive. This algorithm can be used to pre-screen a large number of features to identify those that are potentially useful. The basic idea is to evaluate each feature in the context of many random subsets of other features. HARVEST is predicated on the assumption that an irrelevant feature can add no real predictive value, regardless of which other features are included in the subset. Motivated by this idea, we have derived a simple statistical test for feature relevance. Empirical analyses and simulations produced so far indicate that the HARVEST algorithm is highly effective in predictive analytics, both in science and business.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/01/2020

A generalised OMP algorithm for feature selection with application to gene expression data

Feature selection for predictive analytics is the problem of identifying...
research
02/25/2023

Online Sparse Streaming Feature Selection Using Adapted Classification

Traditional feature selections need to know the feature space before lea...
research
06/16/2022

Powershap: A Power-full Shapley Feature Selection Method

Feature selection is a crucial step in developing robust and powerful ma...
research
04/05/2023

How good Neural Networks interpretation methods really are? A quantitative benchmark

Saliency Maps (SMs) have been extensively used to interpret deep learnin...
research
07/30/2014

Fast Bayesian Feature Selection for High Dimensional Linear Regression in Genomics via the Ising Approximation

Feature selection, identifying a subset of variables that are relevant f...
research
12/17/2022

An Evolutionary Multitasking Algorithm with Multiple Filtering for High-Dimensional Feature Selection

Recently, evolutionary multitasking (EMT) has been successfully used in ...
research
03/02/2019

FRI - Feature Relevance Intervals for Interpretable and Interactive Data Exploration

Most existing feature selection methods are insufficient for analytic pu...

Please sign up or login with your details

Forgot password? Click here to reset