Optimal Las Vegas Approximate Near Neighbors in ℓ_p

07/19/2018
by   Alexander Wei, et al.
0

We show that approximate near neighbor search in high dimensions can be solved in a Las Vegas fashion (i.e., without false negatives) for ℓ_p (1< p< 2) while matching the performance of optimal locality-sensitive hashing. Specifically, we construct a data-independent Las Vegas data structure with query time O(dn^ρ) and space usage O(dn^1+ρ) for (r, c r)-approximate near neighbors in R^d under the ℓ_p norm, where ρ = 1/c^p + o(1). Furthermore, we give a Las Vegas locality-sensitive filter construction for the unit sphere that can be used with the data-dependent data structure of Andoni et al. (SODA 2017) to achieve optimal space-time tradeoffs in the data-dependent setting. For the symmetric case, this gives us a data-dependent Las Vegas data structure with query time O(dn^ρ) and space usage O(dn^1+ρ) for (r, c r)-approximate near neighbors in R^d under the ℓ_p norm, where ρ = 1/(2c^p - 1) + o(1). Our data-independent construction improves on the recent Las Vegas data structure of Ahle (FOCS 2017) for ℓ_p when 1 < p< 2. Our data-dependent construction does even better for ℓ_p for all p∈ [1, 2] and is the first Las Vegas approximate near neighbors data structure to make use of data-dependent approaches. We also answer open questions of Indyk (SODA 2000), Pagh (SODA 2016), and Ahle by showing that for approximate near neighbors, Las Vegas data structures can match state-of-the-art Monte Carlo data structures in performance for both the data-independent and data-dependent settings and across space-time tradeoffs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/21/2020

(2+ε)-ANN for time series under the Fréchet distance

We study approximate-near-neighbor data structures for time series under...
research
06/05/2019

Fair Near Neighbor Search: Independent Range Sampling in High Dimensions

Similarity search is a fundamental algorithmic primitive, widely used in...
research
05/10/2021

Near Neighbor Search via Efficient Average Distortion Embeddings

A recent series of papers by Andoni, Naor, Nikolov, Razenshteyn, and Wai...
research
11/13/2020

Kernel Density Estimation through Density Constrained Near Neighbor Search

In this paper we revisit the kernel density estimation problem: given a ...
research
04/08/2019

Subsets and Supermajorities: Unifying Hashing-based Set Similarity Search

We consider the problem of designing Locality Sensitive Filters (LSF) fo...
research
02/18/2019

RACE: Sub-Linear Memory Sketches for Approximate Near-Neighbor Search on Streaming Data

We demonstrate the first possibility of a sub-linear memory sketch for s...
research
09/04/2021

Fast Succinct Retrieval and Approximate Membership using Ribbon

A retrieval data structure for a static function f:S→{0,1}^r supports qu...

Please sign up or login with your details

Forgot password? Click here to reset