Finding Relevant Points for Nearest-Neighbor Classification

10/12/2021
by   David Eppstein, et al.
0

In nearest-neighbor classification problems, a set of d-dimensional training points are given, each with a known classification, and are used to infer unknown classifications of other points by using the same classification as the nearest training point. A training point is relevant if its omission from the training set would change the outcome of some of these inferences. We provide a simple algorithm for thinning a training set down to its subset of relevant points, using as subroutines algorithms for finding the minimum spanning tree of a set of points and for finding the extreme points (convex hull vertices) of a set of points. The time bounds for our algorithm, in any constant dimension d≥ 3, improve on a previous algorithm for the same problem by Clarkson (FOCS 1994).

READ FULL TEXT
research
02/04/2023

Reducing Nearest Neighbor Training Sets Optimally and Exactly

In nearest-neighbor classification, a training set P of points in ℝ^d wi...
research
03/07/2022

Improved Search of Relevant Points for Nearest-Neighbor Classification

Given a training set P ⊂ℝ^d, the nearest-neighbor classifier assigns any...
research
08/15/2022

Training-Time Attacks against k-Nearest Neighbors

Nearest neighbor-based methods are commonly used for classification task...
research
02/25/2019

Adaptive Estimation for Approximate k-Nearest-Neighbor Computations

Algorithms often carry out equally many computations for "easy" and "har...
research
10/23/2017

An iterative closest point method for measuring the level of similarity of 3d log scans in wood industry

In the Canadian's lumber industry, simulators are used to predict the lu...
research
12/14/2017

Adaptive kNN using Expected Accuracy for Classification of Geo-Spatial Data

The k-Nearest Neighbor (kNN) classification approach is conceptually sim...
research
02/10/2018

A Continuation Method for Discrete Optimization and its Application to Nearest Neighbor Classification

The continuation method is a popular approach in non-convex optimization...

Please sign up or login with your details

Forgot password? Click here to reset