Finding Significant Combinations of Continuous Features

02/28/2017
by   Mahito Sugiyama, et al.
0

We present an efficient feature selection method that can find all multiplicative combinations of continuous features that are statistically significantly associated with the class variable, while rigorously correcting for multiple testing. The key to overcome the combinatorial explosion in the number of candidates is to derive a lower bound on the p-value for each feature combination, which enables us to massively prune combinations that can never be significant and gain more statistical power. While this problem has been addressed for binary features in the past, we here present the first solution for continuous features. In our experiments, our novel approach detects true feature combinations with higher precision and recall than competing methods that require a prior binarization of the data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/05/2018

An enhanced computational feature selection method for medical synonym identification via bilingualism and multi-corpus training

Medical synonym identification has been an important part of medical nat...
research
03/13/2023

Evolutionary quantum feature selection

Effective feature selection is essential for enhancing the performance o...
research
11/16/2021

On the utility of power spectral techniques with feature selection techniques for effective mental task classification in noninvasive BCI

In this paper classification of mental task-root Brain-Computer Interfac...
research
07/04/2014

Identifying Higher-order Combinations of Binary Features

Finding statistically significant interactions between binary variables ...
research
10/08/2019

Controlling Costs: Feature Selection on a Budget

The traditional framework for feature selection treats all features as c...
research
12/22/2016

Finding Statistically Significant Attribute Interactions

In many data exploration tasks it is meaningful to identify groups of at...
research
03/24/2022

A Simple Data-Driven Level Finding Method of Quantum Many-Body Systems based on Statistical Outlier Detection

We report a simple and pure data-driven method to find new energy levels...

Please sign up or login with your details

Forgot password? Click here to reset