Hyperplane Arrangements and Locality-Sensitive Hashing with Lift

12/26/2012
by   Makiko Konoshima, et al.
0

Locality-sensitive hashing converts high-dimensional feature vectors, such as image and speech, into bit arrays and allows high-speed similarity calculation with the Hamming distance. There is a hashing scheme that maps feature vectors to bit arrays depending on the signs of the inner products between feature vectors and the normal vectors of hyperplanes placed in the feature space. This hashing can be seen as a discretization of the feature space by hyperplanes. If labels for data are given, one can determine the hyperplanes by using learning algorithms. However, many proposed learning methods do not consider the hyperplanes' offsets. Not doing so decreases the number of partitioned regions, and the correlation between Hamming distances and Euclidean distances becomes small. In this paper, we propose a lift map that converts learning algorithms without the offsets to the ones that take into account the offsets. With this method, the learning methods without the offsets give the discretizations of spaces as if it takes into account the offsets. For the proposed method, we input several high-dimensional feature data sets and studied the relationship between the statistical characteristics of data, the number of hyperplanes, and the effect of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/16/2013

Kernelized Locality-Sensitive Hashing for Semi-Supervised Agglomerative Clustering

Large scale agglomerative clustering is hindered by computational burden...
research
12/15/2019

Efficient Bitmap-based Indexing and Retrieval of Similarity Search Image Queries

Finding similar images is a necessary operation in many multimedia appli...
research
02/10/2020

Locality-sensitive hashing in function spaces

We discuss the problem of performing similarity search over function spa...
research
10/18/2016

Using Centroidal Voronoi Tessellations to Scale Up the Multi-dimensional Archive of Phenotypic Elites Algorithm

The recently introduced Multi-dimensional Archive of Phenotypic Elites (...
research
01/26/2022

Rapid solution for searching similar audio items

A naive approach for finding similar audio items would be to compare eac...
research
02/11/2019

Deep Hashing using Entropy Regularised Product Quantisation Network

In large scale systems, approximate nearest neighbour search is a crucia...
research
06/08/2018

A neural network catalyzer for multi-dimensional similarity search

This paper aims at learning a function mapping input vectors to an outpu...

Please sign up or login with your details

Forgot password? Click here to reset