Combining Feature and Prototype Pruning by Uncertainty Minimization

01/16/2013
by   Marc Sebban, et al.
0

We focus in this paper on dataset reduction techniques for use in k-nearest neighbor classification. In such a context, feature and prototype selections have always been independently treated by the standard storage reduction algorithms. While this certifying is theoretically justified by the fact that each subproblem is NP-hard, we assume in this paper that a joint storage reduction is in fact more intuitive and can in practice provide better results than two independent processes. Moreover, it avoids a lot of distance calculations by progressively removing useless instances during the feature pruning. While standard selection algorithms often optimize the accuracy to discriminate the set of solutions, we use in this paper a criterion based on an uncertainty measure within a nearest-neighbor graph. This choice comes from recent results that have proven that accuracy is not always the suitable criterion to optimize. In our approach, a feature or an instance is removed if its deletion improves information of the graph. Numerous experiments are presented in this paper and a statistical analysis shows the relevance of our approach, and its tolerance in the presence of noise.

READ FULL TEXT

page 1

page 2

research
04/03/2020

Nearest neighbor representations of Boolean functions

A nearest neighbor representation of a Boolean function is a set of posi...
research
01/30/2022

Empirical complexity of comparator-based nearest neighbor descent

A Java parallel streams implementation of the K-nearest neighbor descent...
research
02/16/2020

Coresets for the Nearest-Neighbor Rule

The problem of nearest-neighbor condensation deals with finding a subset...
research
08/15/2022

Training-Time Attacks against k-Nearest Neighbors

Nearest neighbor-based methods are commonly used for classification task...
research
05/31/2017

Towards Learned Clauses Database Reduction Strategies Based on Dominance Relationship

Clause Learning is one of the most important components of a conflict dr...

Please sign up or login with your details

Forgot password? Click here to reset