Feature selection with test cost constraint

09/25/2012
by   Fan Min, et al.
0

Feature selection is an important preprocessing step in machine learning and data mining. In real-world applications, costs, including money, time and other resources, are required to acquire the features. In some cases, there is a test cost constraint due to limited resources. We shall deliberately select an informative and cheap feature subset for classification. This paper proposes the feature selection with test cost constraint problem for this issue. The new problem has a simple form while described as a constraint satisfaction problem (CSP). Backtracking is a general algorithm for CSP, and it is efficient in solving the new problem on medium-sized data. As the backtracking algorithm is not scalable to large datasets, a heuristic algorithm is also developed. Experimental results show that the heuristic algorithm can find the optimal solution in most cases. We also redefine some existing feature selection problems in rough sets, especially in decision-theoretic rough sets, from the viewpoint of CSP. These new definitions provide insight to some new research directions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/31/2018

A Fuzzy-Rough based Binary Shuffled Frog Leaping Algorithm for Feature Selection

Feature selection and attribute reduction are crucial problems, and wide...
research
12/29/2021

An Efficient and Accurate Rough Set for Feature Selection, Classification and Knowledge Representation

This paper present a strong data mining method based on rough set, which...
research
06/23/2010

A Novel Rough Set Reduct Algorithm for Medical Domain Based on Bee Colony Optimization

Feature selection refers to the problem of selecting relevant features w...
research
05/04/2014

Feature Selection On Boolean Symbolic Objects

With the boom in IT technology, the data sets used in application are mo...
research
11/12/2012

Minimal cost feature selection of data with normal distribution measurement errors

Minimal cost feature selection is devoted to obtain a trade-off between ...
research
10/09/2020

Nonnegative Spectral Analysis with Adaptive Graph and L_2,0-Norm Regularization for Unsupervised Feature Selection

Feature selection is an important data preprocessing in data mining and ...
research
06/27/2017

Unsupervised Feature Selection Based on Space Filling Concept

The paper deals with the adaptation of a new measure for the unsupervise...

Please sign up or login with your details

Forgot password? Click here to reset