Instance-based learning using the Half-Space Proximal Graph

02/04/2021
by   Ariana Talamantes, et al.
0

The primary example of instance-based learning is the k-nearest neighbor rule (kNN), praised for its simplicity and the capacity to adapt to new unseen data and toss away old data. The main disadvantages often mentioned are the classification complexity, which is O(n), and the estimation of the parameter k, the number of nearest neighbors to be used. The use of indexes at classification time lifts the former disadvantage, while there is no conclusive method for the latter. This paper presents a parameter-free instance-based learning algorithm using the Half-Space Proximal (HSP) graph. The HSP neighbors simultaneously possess proximity and variety concerning the center node. To classify a given query, we compute its HSP neighbors and apply a simple majority rule over them. In our experiments, the resulting classifier bettered KNN for any k in a battery of datasets. This improvement sticks even when applying weighted majority rules to both kNN and HSP classifiers. Surprisingly, when using a probabilistic index to approximate the HSP graph and consequently speeding-up the classification task, our method could improve its accuracy in stark contrast with the kNN classifier, which worsens with a probabilistic index.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/27/2013

An improvement to k-nearest neighbor classifier

K-Nearest neighbor classifier (k-NNC) is simple to use and has little de...
research
11/19/2022

A Two-Stage Active Learning Algorithm for k-Nearest Neighbors

We introduce a simple and intuitive two-stage active learning algorithm ...
research
02/25/2019

Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Algorithms often carry out equally many computations for "easy" and "har...
research
08/20/2019

Multi-hypothesis classifier

Accuracy is the most important parameter among few others which defines ...
research
11/12/2022

Far Away in the Deep Space: Nearest-Neighbor-Based Dense Out-of-Distribution Detection

The key to out-of-distribution detection is density estimation of the in...
research
06/03/2016

A Graph-Based Semi-Supervised k Nearest-Neighbor Method for Nonlinear Manifold Distributed Data Classification

k Nearest Neighbors (kNN) is one of the most widely used supervised lear...

Please sign up or login with your details

Forgot password? Click here to reset