The Role of Local Intrinsic Dimensionality in Benchmarking Nearest Neighbor Search

07/17/2019
by   Martin Aumüller, et al.
0

This paper reconsiders common benchmarking approaches to nearest neighbor search. It is shown that the concept of local intrinsic dimensionality (LID) allows to choose query sets of a wide range of difficulty for real-world datasets. Moreover, the effect of different LID distributions on the running time performance of implementations is empirically studied. To this end, different visualization concepts are introduced that allow to get a more fine-grained overview of the inner workings of nearest neighbor search principles. The paper closes with remarks about the diversity of datasets commonly used for nearest neighbor search benchmarking. It is shown that such real-world datasets are not diverse: results on a single dataset predict results on all other datasets well.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

LANNS: A Web-Scale Approximate Nearest Neighbor Lookup System

Nearest neighbor search (NNS) has a wide range of applications in inform...
research
07/11/2023

Quantitative Comparison of Nearest Neighbor Search Algorithms

We compare the performance of three nearest neighbor search algorithms: ...
research
10/09/2019

Scalable Nearest Neighbor Search for Optimal Transport

The Optimal Transport (a.k.a. Wasserstein) distance is an increasingly p...
research
10/02/2021

Tao: A Learning Framework for Adaptive Nearest Neighbor Search using Static Features Only

Approximate nearest neighbor (ANN) search is a fundamental problem in ar...
research
11/19/2018

DeepIR: A Deep Semantics Driven Framework for Image Retargeting

We present Deep Image Retargeting (DeepIR), a coarse-to-fine framework f...
research
01/29/2021

A Comprehensive Survey and Experimental Comparison of Graph-Based Approximate Nearest Neighbor Search

Approximate nearest neighbor search (ANNS) constitutes an important oper...
research
07/15/2018

ANN-Benchmarks: A Benchmarking Tool for Approximate Nearest Neighbor Algorithms

This paper describes ANN-Benchmarks, a tool for evaluating the performan...

Please sign up or login with your details

Forgot password? Click here to reset