Pre-Selection of Independent Binary Features: An Application to Diagnosing Scrapie in Sheep

07/11/2012
by   Ludmila Kuncheva, et al.
0

Suppose that the only available information in a multi-class problem are expert estimates of the conditional probabilities of occurrence for a set of binary features. The aim is to select a subset of features to be measured in subsequent data collection experiments. In the lack of any information about the dependencies between the features, we assume that all features are conditionally independent and hence choose the Naive Bayes classifier as the optimal classifier for the problem. Even in this (seemingly trivial) case of complete knowledge of the distributions, choosing an optimal feature subset is not straightforward. We discuss the properties and implementation details of Sequential Forward Selection (SFS) as a feature selection procedure for the current problem. A sensitivity analysis was carried out to investigate whether the same features are selected when the probabilities vary around the estimated values. The procedure is illustrated with a set of probability estimates for Scrapie in sheep.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/12/2015

Search Strategies for Binary Feature Selection for a Naive Bayes Classifier

We compare in this paper several feature selection methods for the Naive...
research
01/16/2013

Bayesian Classification and Feature Selection from Finite Data Sets

Feature selection aims to select the smallest subset of features for a s...
research
05/23/2019

Naive Feature Selection: Sparsity in Naive Bayes

Due to its linear complexity, naive Bayes classification remains an attr...
research
01/23/2021

Feature Selection Using Reinforcement Learning

With the decreasing cost of data collection, the space of variables or f...
research
04/07/2016

Building Ensembles of Adaptive Nested Dichotomies with Random-Pair Selection

A system of nested dichotomies is a method of decomposing a multi-class ...
research
02/21/2023

Don't guess what's true: choose what's optimal. A probability transducer for machine-learning classifiers

In fields such as medicine and drug discovery, the ultimate goal of a cl...
research
01/12/2020

On Feature Interactions Identified by Shapley Values of Binary Classification Games

For feature selection and related problems, we introduce the notion of c...

Please sign up or login with your details

Forgot password? Click here to reset