Similarity Problems in High Dimensions

The main contribution of this dissertation is the introduction of new or improved approximation algorithms and data structures for several similarity search problems. We examine the furthest neighbor query, the annulus query, distance sensitive membership, nearest neighbor preserving embeddings and set similarity queries in the large-scale, high-dimensional setting.

READ FULL TEXT
research
12/06/2018

Confirmation Sampling for Exact Nearest Neighbor Search

Locality-sensitive hashing (LSH), introduced by Indyk and Motwani in STO...
research
08/05/2023

DeDrift: Robust Similarity Search under Content Drift

The statistical distribution of content uploaded and searched on media s...
research
06/22/2021

Practical Near Neighbor Search via Group Testing

We present a new algorithm for the approximate near neighbor problem tha...
research
08/02/2023

Rethinking Similarity Search: Embracing Smarter Mechanisms over Smarter Data

In this vision paper, we propose a shift in perspective for improving th...
research
03/20/2017

Fast Spectral Ranking for Similarity Search

Despite the success of deep learning on representing images for particul...
research
08/07/2023

Semantic Equivalence of e-Commerce Queries

Search query variation poses a challenge in e-commerce search, as equiva...
research
03/08/2021

Nearest Neighbor Search Under Uncertainty

Nearest Neighbor Search (NNS) is a central task in knowledge representat...

Please sign up or login with your details

Forgot password? Click here to reset