Hashing for Similarity Search: A Survey

08/13/2014
by   Jingdong Wang, et al.
0

Similarity search (nearest neighbor search) is a problem of pursuing the data items whose distances to a query item are the smallest from a large database. Various methods have been developed to address this problem, and recently a lot of efforts have been devoted to approximate search. In this paper, we present a survey on one of the main solutions, hashing, which has been widely studied since the pioneering work locality sensitive hashing. We divide the hashing algorithms two main categories: locality sensitive hashing, which designs hash functions without exploring the data distribution and learning to hash, which learns hash functions according the data distribution, and review them from various aspects, including hash function design and distance measure and search scheme in the hash coding space.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2022

Falconn++: A Locality-sensitive Filtering Approach for Approximate Nearest Neighbor Search

We present Falconn++, a novel locality-sensitive filtering (LSF) approac...
research
09/17/2015

Learning to Hash for Indexing Big Data - A Survey

The explosive growth in big data has attracted much attention in designi...
research
06/01/2016

A Survey on Learning to Hash

Nearest neighbor search is a problem of finding the data points from the...
research
11/01/2022

Asymmetric Hashing for Fast Ranking via Neural Network Measures

Fast item ranking is an important task in recommender systems. In previo...
research
09/18/2021

When Similarity Digest Meets Vector Management System: A Survey on Similarity Hash Function

The booming vector manage system calls for feasible similarity hash func...
research
04/05/2023

Unfolded Self-Reconstruction LSH: Towards Machine Unlearning in Approximate Nearest Neighbour Search

Approximate nearest neighbour (ANN) search is an essential component of ...
research
02/11/2019

Deep Hashing using Entropy Regularised Product Quantisation Network

In large scale systems, approximate nearest neighbour search is a crucia...

Please sign up or login with your details

Forgot password? Click here to reset