Reducing Nearest Neighbor Training Sets Optimally and Exactly

02/04/2023
by   Josiah Rohrer, et al.
0

In nearest-neighbor classification, a training set P of points in ℝ^d with given classification is used to classify every point in ℝ^d: Every point gets the same classification as its nearest neighbor in P. Recently, Eppstein [SOSA'22] developed an algorithm to detect the relevant training points, those points p∈ P, such that P and P∖{p} induce different classifications. We investigate the problem of finding the minimum cardinality reduced training set P'⊆ P such that P and P' induce the same classification. We show that the set of relevant points is such a minimum cardinality reduced training set if P is in general position. Furthermore, we show that finding a minimum cardinality reduced training set for possibly degenerate P is in P for d=1, and NP-complete for d≥ 2.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/12/2021

Finding Relevant Points for Nearest-Neighbor Classification

In nearest-neighbor classification problems, a set of d-dimensional trai...
research
02/16/2020

Coresets for the Nearest-Neighbor Rule

The problem of nearest-neighbor condensation deals with finding a subset...
research
08/25/2017

k-Nearest Neighbor Augmented Neural Networks for Text Classification

In recent years, many deep-learning based models are proposed for text c...
research
06/28/2020

Social Distancing is Good for Points too!

The nearest-neighbor rule is a well-known classification technique that,...
research
10/19/2018

Stochastic temporal data upscaling using the generalized k-nearest neighbor algorithm

Three methods of temporal data upscaling, which may collectively be call...
research
07/24/2019

A graphical heuristic for reduction and partitioning of large datasets for scalable supervised training

A scalable graphical method is presented for selecting, and partitioning...
research
02/05/2019

Analyzing and Improving Representations with the Soft Nearest Neighbor Loss

We explore and expand the Soft Nearest Neighbor Loss to measure the enta...

Please sign up or login with your details

Forgot password? Click here to reset