Fast k-Nearest Neighbour Search via Prioritized DCI

03/01/2017
by   Ke Li, et al.
0

Most exact methods for k-nearest neighbour search suffer from the curse of dimensionality; that is, their query times exhibit exponential dependence on either the ambient or the intrinsic dimensionality. Dynamic Continuous Indexing (DCI) offers a promising way of circumventing the curse and successfully reduces the dependence of query time on intrinsic dimensionality from exponential to sublinear. In this paper, we propose a variant of DCI, which we call Prioritized DCI, and show a remarkable improvement in the dependence of query time on intrinsic dimensionality. In particular, a linear increase in intrinsic dimensionality, or equivalently, an exponential increase in the number of points near a query, can be mostly counteracted with just a linear increase in space. We also demonstrate empirically that Prioritized DCI significantly outperforms prior methods. In particular, relative to Locality-Sensitive Hashing (LSH), Prioritized DCI reduces the number of distance evaluations by a factor of 14 to 116 and the memory consumption by a factor of 21.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/01/2015

Fast k-Nearest Neighbour Search via Dynamic Continuous Indexing

Existing methods for retrieving k-nearest neighbours suffer from the cur...
research
05/25/2020

On the Problem of p_1^-1 in Locality-Sensitive Hashing

A Locality-Sensitive Hash (LSH) function is called (r,cr,p_1,p_2)-sensit...
research
03/26/2018

On the Intrinsic Dimensionality of Face Representation

The two underlying factors that determine the efficacy of face represent...
research
06/23/2020

ABID: Angle Based Intrinsic Dimensionality

The intrinsic dimensionality refers to the “true” dimensionality of the ...
research
10/10/2018

Technical Report: KNN Joins Using a Hybrid Approach: Exploiting CPU/GPU Workload Characteristics

This paper studies finding the K nearest neighbors (KNN) of all points i...
research
04/09/2020

k-Nearest Neighbour Classifiers – 2nd Edition

Perhaps the most straightforward classifier in the arsenal or machine le...
research
12/21/2020

Sublinear Maximum Inner Product Search using Concomitants of Extreme Order Statistics

We propose a novel dimensionality reduction method for maximum inner pro...

Please sign up or login with your details

Forgot password? Click here to reset