Nearest Neighbor Search Under Uncertainty

03/08/2021
by   Blake Mason, et al.
0

Nearest Neighbor Search (NNS) is a central task in knowledge representation, learning, and reasoning. There is vast literature on efficient algorithms for constructing data structures and performing exact and approximate NNS. This paper studies NNS under Uncertainty (NNSU). Specifically, consider the setting in which an NNS algorithm has access only to a stochastic distance oracle that provides a noisy, unbiased estimate of the distance between any pair of points, rather than the exact distance. This models many situations of practical importance, including NNS based on human similarity judgements, physical measurements, or fast, randomized approximations to exact distances. A naive approach to NNSU could employ any standard NNS algorithm and repeatedly query and average results from the stochastic oracle (to reduce noise) whenever it needs a pairwise distance. The problem is that a sufficient number of repeated queries is unknown in advance; e.g., a point maybe distant from all but one other point (crude distance estimates suffice) or it may be close to a large number of other points (accurate estimates are necessary). This paper shows how ideas from cover trees and multi-armed bandits can be leveraged to develop an NNSU algorithm that has optimal dependence on the dataset size and the (unknown)geometry of the dataset.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/26/2020

Query Complexity of k-NN based Mode Estimation

Motivated by the mode estimation problem of an unknown multivariate prob...
research
04/05/2017

Comparison Based Nearest Neighbor Search

We consider machine learning in a comparison-based setting where we are ...
research
05/30/2019

Learning Nearest Neighbor Graphs from Noisy Distance Samples

We consider the problem of learning the nearest neighbor graph of a data...
research
06/11/2019

Similarity Problems in High Dimensions

The main contribution of this dissertation is the introduction of new or...
research
03/09/2018

TRAJEDI: Trajectory Dissimilarity

The vast increase in our ability to obtain and store trajectory data nec...
research
11/03/2020

Greedy k-Center from Noisy Distance Samples

We study a variant of the canonical k-center problem over a set of verti...
research
05/12/2021

How to Design Robust Algorithms using Noisy Comparison Oracle

Metric based comparison operations such as finding maximum, nearest and ...

Please sign up or login with your details

Forgot password? Click here to reset