Fast and Bayes-consistent nearest neighbors

10/07/2019
by   Klim Efremenko, et al.
0

Research on nearest-neighbor methods tends to focus somewhat dichotomously either on the statistical or the computational aspects – either on, say, Bayes consistency and rates of convergence or on techniques for speeding up the proximity search. This paper aims at bridging these realms: to reap the advantages of fast evaluation time while maintaining Bayes consistency, and further without sacrificing too much in the risk decay rate. We combine the locality-sensitive hashing (LSH) technique with a novel missing-mass argument to obtain a fast and Bayes-consistent classifier. Our algorithm's prediction runtime compares favorably against state of the art approximate NN methods, while maintaining Bayes-consistency and attaining rates comparable to minimax. On samples of size n in ^d, our pre-processing phase has runtime O(d n log n), while the evaluation phase has runtime O(dlog n) per query point.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/01/2014

A Bayes consistent 1-NN classifier

We show that a simple modification of the 1-nearest neighbor classifier ...
research
06/24/2019

Universal Bayes consistency in metric spaces

We show that a recently proposed 1-nearest-neighbor-based multiclass lea...
research
06/19/2020

Improving Locality Sensitive Hashing by Efficiently Finding Projected Nearest Neighbors

Similarity search in high-dimensional spaces is an important task for ma...
research
02/11/2018

On the Rates of Convergence from Surrogate Risk Minimizers to the Bayes Optimal Classifier

We study the rates of convergence from empirical surrogate risk minimize...
research
09/28/2017

Improved approximate near neighbor search without false negatives for l_2

We present a new algorithm for the c--approximate nearest neighbor searc...
research
09/11/2015

A reliable order-statistics-based approximate nearest neighbor search algorithm

We propose a new algorithm for fast approximate nearest neighbor search ...
research
10/19/2022

Fries: Fast and Consistent Runtime Reconfiguration in Dataflow Systems with Transactional Guarantees (Extended Version)

A computing job in a big data system can take a long time to run, especi...

Please sign up or login with your details

Forgot password? Click here to reset