Near Neighbor: Who is the Fairest of Them All?

06/06/2019
by   Sariel Har-Peled, et al.
0

In this work we study a fair variant of the near neighbor problem. Namely, given a set of n points P and a parameter r, the goal is to preprocess the points, such that given a query point q, any point in the r-neighborhood of the query, i.e., (q,r), have the same probability of being reported as the near neighbor. We show that LSH based algorithms can be made fair, without a significant loss in efficiency. Specifically, we show an algorithm that reports a point in the r-neighborhood of a query q with almost uniform probability. The query time is proportional to O( dns(q.r) (n,c) ), and its space is O((n,c)), where (n,c) and (n,c) are the query time and space of an LSH algorithm for c-approximate near neighbor, and dns(q,r) is a function of the local density around q. Our approach works more generally for sampling uniformly from a sub-collection of sets of a given collection and can be used in a few other applications. Finally, we run experiments to show performance of our approach on real data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/26/2021

Sampling a Near Neighbor in High Dimensions – Who is the Fairest of Them All?

Similarity search is a fundamental algorithmic primitive, widely used in...
research
11/13/2020

Kernel Density Estimation through Density Constrained Near Neighbor Search

In this paper we revisit the kernel density estimation problem: given a ...
research
06/05/2019

Fair Near Neighbor Search: Independent Range Sampling in High Dimensions

Similarity search is a fundamental algorithmic primitive, widely used in...
research
04/30/2020

A more secure IPv6 neighborhood process

The process of neighborhood establishment in an IPv6 network is made out...
research
06/30/2018

Approximate Nearest Neighbors in Limited Space

We consider the (1+ϵ)-approximate nearest neighbor search problem: given...
research
08/10/2021

Fast and Fair Lock-Free Locks

We present a randomized approach for lock-free locks with strong bounds ...
research
04/08/2023

Analysis of Sampling Strategies for Implicit 3D Reconstruction

In the training process of the implicit 3D reconstruction network, the c...

Please sign up or login with your details

Forgot password? Click here to reset